(51) 



(21) International Application Number: PCT/US99/025 1 1 

(22) International Filing Date: 5 February 1999 (05.02.99) 

(30) Priority Data: 

09/019,070 5 February 1998 (05.02.98) US 

09/124,180 28 July 1998 (28.07.98) US 

(71) Applicant: THE JOHNS HOPKINS UNIVERSITY SCHOOL 

OF MEDICINE [US/US]; 720 Rutland Avenue, Baltimore, 
MD 21205 (US). 

(72) Inventors: LEE, Se-Jin; 6711 Chokeberry Road, Baltimore, 

MD 21209 (US). MCPHERRON, Alexandra, C; 519 W. 
Pratt Street #218, Baltimore, MD 21201 (US). 



i.99) 



(81) Designated States: AL, AM, AT, AU, AZ, BA, BB, BG, BR, 
BY, CA, CH, CN, CU, CZ, DE, DK, EE, ES, FI, GB, GD, 
GE, GH, GM, HR, HU, ID, IL, IN, IS, JP. KE, KG, KP, 
KR, KZ, LC. LK, LR, LS, LT. LU, LV, MD, MG, MK, 
MN, MW, MX, NO, NZ, PL, PT, RO, RU, SD, SE f SG, 
SI, SK, SL, TJ, TM, TR, TT, UA, UG, UZ, VN, YU, ZW, 
ARIPO patent (GH, GM, KE, LS, MW, SD, SZ, UG. ZW), 
Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, 
GB, GR, IE, IT, LU, MC, NL, PT, SE), OAPI patent (BF, 
BJ, CF, CG, CI. CM, GA, GN, GW, ML, MR, NE, SN, 
TD, TG). 



Published 

With international search report. 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



International Patent Classification 6 : 

C12N 5/00, 15/00, 15/09, 15/63, G01N 
33/00, A61K 39/395, 48/00 



Al 



(11) International Publication Number: WO 99/401 

(43) International Publication Date: 12 August 1999 (12.08 



(74) Agent: HAILE, Lisa, A.; Fish & Richardson P.C., Suite 1400, 
4225 Executive Square, La Jolla, CA 92037 (US). 



(54) Title: GROWTH DIFFERENTIATION FACTOR-8 



(57) Abstract 



A transgenic non-human animal of the species selected from the group consisting of avian, bovine, ovine and porcine having a 
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GROWTH DIFFERENTIATION FACTOR-8 

BACKGROUND OF THE INVENTION 

5 1 . Field of the Invention 

The invention relates generally to growth factors and specifically to a new member of the 
transforming growth factor beta (TGF-P) superfamily, which is denoted, growth 
differentiation factor-8 (GDF-8) and methods of use for modulating muscle, bone, kidney 
and adipose cell and tissue growth. 

10 2. Description of Related Art 

The transforming growth factor p (TGF-P) superfamily encompasses a group of 
structurally-related proteins which affect a wide range of differentiation processes during 
embryonic development. The family includes, Mullerian inhibiting substance (MIS), 
which is required for normal male sex development (Behringer, et al t Nature , 345 :167. 

15 1990), Drosophila decapentaplegic (DPP) gene product, which is required for 
dorsal- ventral axis formation and morphogenesis of the imaginal disks (Padgett, et ai, 
Nature, 325:81 -84, 1987), the Xenopus Vg-1 gene product, which localizes to the 
vegetal pole of eggs ((Weeks, et al„ Cell, 51:861-867, 1987), the activins (Mason, et ai, 
Biochem, Biophys, Res. Commun., 135:957-964, 1986), which can induce the formation 

20 of mesoderm and anterior structures in Xenopus embryos (Thomsen, et al t Cell, 63:485, 
1990), and the bone morphogenetic proteins (BMPs, osteogenin, OP-1) which can induce 
de novo cartilage and bone formation (Sampath, et al, J. Biol Chem.* 265 :13198, 1990). 
The TGF-Ps can influence a variety of differentiation processes, including adipogenesis, 
myogenesis, chondrogenesis, hematopolesis, and epithelial cell differentiation (for 

25 review, see Massague, Cell 49:437, 1987). 
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The proteins of the TGF-P family are initially synthesized as a large precursor protein 
which subsequently undergoes proteolytic cleavage at a cluster of basic residues 
approximately 110-140 amino acids from the C-terminus. The C-terminal regions, or 
mature regions, of the proteins are all structurally related and the different family 
5 members can be classified into distinct subgroups based on the extent of their homology. 
Although the homologies within particular subgroups range from 70% to 90% amino 
acid sequence identity, the homologies between subgroups are significantly lower, 
generally ranging from only 20% to 50%. In each case, the active species appears to be 
adisulfide-linked dimer of C-terminal fragments. Studies have shown that when the pro- 
10 region of a member of the TGF-p family is coexpressed with a mature region of another 
member of the TGF-p family, intracellular dimerization and secretion of biologically 
active homodimers occur (Gray, A. etal, Science, 247: 1328, 1990). Additional studies 
by Hammonds, et al, (Molec. Endocrin. 5:149, 1991) showed that the use of the BMP-2 
pro-region combined with the BMP-4 mature region led to dramatically improved 
1 5 expression of mature BMP-4. For most of the family members that have been studied, 
the homodimeric species has been found to be biologically active, but for other family 
members, like the inhibins (Ling, et al, Nature, 321 :?79, 1986) and the TGF-p S 
(Cheifetz, et al., Cell, 48:409, 1987), heterodimers have also been detected, and these 
appear to have different biological properties than the respective homodimers. 



20 



In addition it is desirable to produce livestock and game animals, such as cows, sheep, 
pigs, chicken and turkey, fish which are relatively high in musculature and protein, and 
low in fat content. Many drug and diet regimens exist which may help increase muscle 
and protein content and lower undesirably high fat and/or cholesterol levels, but such 
25 treatment is generally administered after the fact, and is begun only after significant 
damage has occurred to the vasculature. Accordingly, it would be desirable to produce 
animals which are genetically predisposed to having higher muscle and/or bone content, 
without any ancillary increase in fat levels. 
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The food industry has put much effort into increasing the amount of muscle and protein 
in foodstuffs. This quest is relatively simple in the manufacture of synthetic foodstuffs, 
but has been met with limited success in the preparation of animal foodstuffs. Attempts 
have been made, for example, to lower cholesterol levels in beef and poultry products by 
5 including cholesterol-lowering drugs in animal feed (see e.g. Elkin and Rogler, J. Agric. 
Food Chem. 1990, 38, 1635-1641). However, there remains a need for more effective 
methods of increasing muscle and reducing fat and cholesterol levels in animal food 
products. 

SUMMARY OF THE INVENTION 

10 The present invention provides a cell growth and differentiation factor, GDF-8, a 
polynucleotide sequence which encodes the factor, and antibodies which are immunore- 
active with the factor. This factor appears to relate to various cell proliferative disorders, 
especially those involving muscle, nerve, bone, kidney and adipose tissue. 

In one embodiment, the invention provides a method for detecting a cell proliferative 
1 5 disorder of muscle, nerve, bone, kidney or fat origin and which is associated with GDF-8. 
In another embodiment, the invention provides a method for treating a cell proliferative 
disorder by suppressing or enhancing GDF-8 activity. 

In another embodiment, the subject invention provides non-human transgenic animals 
which are useful as a source of food products with high muscle, bone and protein content, 

20 and reduced fat and cholesterol content. The animals have been altered chromosomally 
in their germ cells and somatic cells so that the production of GDF-8 is produced in 
reduced amounts, or is completely disrupted, resulting in animals with decreased levels 
of GDF-8 in their system and higher than normal levels of muscle tissue and bone tissue, 
such as ribs, preferably without increased fat and/or cholesterol levels. Accordingly, the 

25 present invention also includes food products provided by the animals. Such food 
products have increased nutritional value because of the increase in muscle tissue and 
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bone content. The transgenic non-human animals of the invention include bovine, 
porcine, ovine and avian animals, for example. 

The subject invention also provides a method of producing animal food products having 
increased bone content. The method includes modifying the genetic makeup of the germ 

5 cells of a pronuclear embryo of the animal, implanting the embryo into the oviduct of a 
pseudopregnant female thereby allowing the embryo to mature to full term progeny, 
testing the progeny for presence of the transgene to identify transgene-positive progeny, 
cross-breeding transgene-positive progeny to obtain further transgene-positive progeny 
and processing the progeny to obtain foodstuff. The modification of the germ cell 

1 0 comprises altering the genetic composition so as to disrupt or reduce the expression of 
the naturally occurring gene encoding for production of GDF-8 protein. In a particular 
embodiment, the transgene comprises antisense polynucleotide sequences to the GDF-8 
protein. Alternatively, the transgene may comprise a non-functional sequence which 
replaces or intervenes in the native GDF-8 gene. 

1 5 The subject invention also provides a method of producing avian food products having 
improved muscle and/or bone content. The method includes modifying the genetic 
makeup of the germ cells of a pronuclear embryo of the avian animal, implanting the 
embryo into the oviduct of a pseudopregnant female into an embryo of a chicken, 
culturing the embryo under conditions whereby progeny are hatched, testing the progeny 

20 for presence of the genetic alteration to identify transgene-positive progeny, cross- 
breeding transgene-positive progeny and processing the progeny to obtain foodstuff. 

The invention also provides a method for treating a muscle, bone, kidney or adipose 
tissue disorder in a subject. The method includes administering a therapeutically 
effective amount of a GDF-8 agent to the subject, thereby inhibiting abnormal growth 
25 of muscle, bone or adipose tissue. The GDF-8 agent may include an antibody, a GDF-8 
antisense molecule or a dominant negative polypeptide, for example. In one aspect, a 
method for inhibiting the growth regulating actions of GDF-8 by contacting an anti- 
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GDF-8 monoclonal antibody, a GDF-8 antisense molecule or a dominant negative 
polypeptide (or polynucleotide encoding a dominant negative polypeptide) with fetal or 
adult muscle cells, bone cells or progenitor cells is included. These agents can be 
administered to a patient suffering from a disorder such as muscle wasting disease, 
5 neuromuscular disorder, muscle atrophy, osteoporosis, bone degenerative diseases, 
obesity or other adipocyte cell disorders, and aging, for example. In another aspect of 
the invention, the agent may be an agonist of GDF-8 activity. In this embodiment, the 
agonist may be administered to promote kidney cell growth and differentiation in kidney 
tissue. 

10 The invention also provides a method for identifying a compound that affects GDF-8 
activity or gene expression including incubating the compound with GDF-8 polypeptide, 
or with a recombinant cell expressing GDF-8 under conditions sufficient to allow the 
compounds to interact and determining the effect of the compound on GDF-8 activity or 
expression. 

1 5 BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE la is a Northern blot showing expression of GDF-8 mRNA in adult tissues. The 
probe was a partial murine GDF-8 clone. 

FIGURE lb is a Southern blot showing GDF-8 genomic sequences identified in mouse, 
rat, human, monkey, rabbit, cow, pig, dog and chicken. 

20 FIGURE 2 shows partial nucleotide and predicted amino acid sequences of murine 
GDF-8 (FIGURE 2a; SEQ ID NO:l 1 and 12, respectively), human GDF-8 (FIGURE 2b; 
SEQ ID NO: 13 and 14, respectively), rat GDF-8 (FIGURE 2c; SEQ ID NO: 24 and 25, 
respectively) and chicken GDF-8 (FIGURE 2d; SEQ ID NO: 22 and 23, respectively). 
The putative dibasic processing sites in the murine sequence are boxed. 
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FIGURE 3a shows the alignment of the C-terminal sequences of GDF-8 with other 
members of the TGF-p superfamily. The conserved cysteine residues are boxed. Dashes 
denote gaps introduced in order to maximize alignment. 

FIGURE 3b shows the alignment of the C-terminal sequences of GDF-8 from human, 
5 murine, rat and chicken sequences. 

FIGURE 4 shows amino acid homologies among different members of the TGF 
superfamily. Numbers represent percent amino acid identities between each pair 
calculated from the first conserved cysteine to the C-terminus. Boxes represent 
homologies among highly-related members within particular subgroups. 

10 FIGURE 5 shows the sequence of GDF-8. Nucleotide and amino acid sequences of 
murine (FIGURE 5a and 5b)(GenBank accession number U84005; SEQ ID NO: 1 1 and 
12, respectively) and human (FIGURE 5c and 5d; SEQ ID NO:13 and 14, respectively) 
GDF-8 cDNA clones are shown. Numbers indicate nucleotide position relative to the 5' 
end. Consensus N-linked glycosylation signals are shaded. The putative RXXR 

15 proteolytic cleavage sites are boxed. 

FIGURE 6 shows a hydropathicity profile of GDF-8. Average hydrophobicity values for 
murine (FIGURE 6a) and human (FIGURE 6b) GDF-8 were calculated using the method 
of Kyte and Doolittle (J. Mol. Biol, 157:105-132, 1982). Positive numbers indicate 
increasing hydrophobicity. 

20 FIGURE 7 shows a comparison of murine and human GDF-8 amino acid sequences. The 
predicted murine sequence is shown in the top lines and the predicted human sequence 
is shown in the bottom lines. Numbers indicate amino acid position relative to the 
N-terminus. Identities between the two sequences are denoted by a vertical line. 
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FIGURE 8 shows the expression of GDF-8 in bacteria. BL21 (DE3) (pLysS) cells 
carrying a pRSET/GDF-8 expression plasmid were induced with 
isopropylthio-P-galactoside, and the GDF-8 fusion protein was purified by metal chelate 
chromatography. Lanes: total=total cell lysate; soluble^soluble protein fraction; 
5 insoluble=insoluble protein fraction (resuspended in 10 Mm Tris pH 8.0, 50 mM sodium 
phosphate, 8 M urea, and 10 mM P-mercaptoethanol [buffer B]) loaded onto the column, 
pellet=insoluble protein fraction discarded before loading the column; 
flowthrough^proteins not bound by the column; washes=washes carried out in buffer B 
at the indicated pH's. Positions of molecular weight standards are shown at the right. 
10 Arrow indicates the position of the GDF-8 fusion protein. 

FIGURE 9 shows the expression of GDF-8 in mammalian cells. Chinese hamster ovary 
cells were transfected with pMSXND/GDF-8 expression plasmids and selected in G41 8. 
Conditioned media from G4 18 -resistant cells (prepared from cells transfected with 
constructs in which GDF-8 was cloned in either the antisense or sense orientation) were 
15 concentrated, electrophoresed under reducing conditions, blotted, and probed with 
anti-GDF-8 antibodies and [ 125 l]iodoproteinA. Arrow indicates the position of the 
processed GDF-8 protein. 

FIGURE 10 shows the expression of GDF-8 mRNA. Poly A-selected RNA (5|ig each) 
prepared from adult tissues (FIGURE 10a) or placentas end embryos (FIGURE 10b) at 
20 the indicated days of gestation was electrophoresed on formaldehyde gels, blotted, and 
probed with full length murine GDF-8. 

FIGURE 11 shows chromosomal mapping of human GDF-8. DNA samples prepared 
from human/rodent somatic cell hybrid lines were subjected to PCR, electrophoresed on 
agarose gels, blotted, and probed. The human chromosome contained in each of the 
25 hybrid cell lines is identified at the top of each of the first 24 lanes (1-22, X, and Y). In 
the lanes designated M, CHO, and H, the starting DNA template was total genomic DNA 
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from mouse, hamster, and human sources, respectively. In the lane marked Bl, no 
template DNA was used. Numbers at left indicate the mobilities of DNA standards. 

Figure 12a shows a map of the GDF-8 locus (top line) and targeting construct (second 
line). The black and stippled boxes represent coding sequences for the pro- and 
5 C-terminal regions, respectively. The white boxes represent 5' and 3' untranslated 
sequences. A probe derived from the region downstream of the 3' homology fragment 
and upstream of the most distal HindlH site shown hybridizes to an 1 1.2 kb Hindm 
fragment in the GDF-8 gene and a 10.4 kb fragment in an homologously targeted gene. 
Abbreviations: H, Hindlll; X, Xba I. 

10 Figure 12b shows a Southern blot analysis of offspring derived from a mating of 
heterozygous mutant mice. The lanes are as follows: DNA prepared from wild type 129 
SV/J mice (lane 1), targeted embryonic stem cells (lane 2), Fl heterozygous mice (lanes 
3 and 4), and offspring derived from a mating of these mice (lanes 5-13). 

Figure 13 shows the muscle fiber size distribution in mutant and wild type littermates. 

15 Figure 13a shows the smallest cross-sectional fiber widths measured for wild type (n = 
1761) and mutant (n = 1052) tibialis cranial. Figure 13b shows wild type (n = 900) and 
mutant (n = 900) gastrocnemius muscles, and fiber sizes were plotted as a percent of total 
fiber number. Standard deviations were 9 and 10 jon, respectively, for wild type and 
mutant tibialis cranial is and 11 and 9 /xm, respectively, for wild type and mutant 

20 gastrocnemius muscles. Legend: o-o, wild type; mutant. 

Figure 14a shows the nucleotide and deduced amino acid sequence for baboon GDF-8 
(SEQ ID NO: 1 8 and 1 9, respectively). 

Figure 14b shows the nucleotide and deduced amino acid sequence for bovine GDF-8 
(SEQ ID NO: 20 and 21, respectively). 

25 Figure 14c shows the nucleotide and deduced amino acid sequence for chicken GDF-8 
(SEQ ID NO:22 and 23, respectively). 
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Figure 14d shows the nucleotide and deduced amino acid sequence for rat GDF-8 (SEQ 
ID NO:24 and 25, respectively). 

Figure He shows the nucleotide and deduced amino acid sequence for turkey GDF-8 
(SEQ ID NO:26 and 27, respectively). 

5 Figure 1 4f shows the nucleotide and deduced amino acid sequence for porcine GDF-8 
(SEQ ID NO:28 and 29, respectively). 

Figure 1 4g shows the nucleotide and deduced amino acid sequence for ovine GDF-8 
(SEQ ID NO:30 and 31, respectively). 

Figures 15a and 15b show an alignment between murine, rat, human, porcine, ovine, 
10 baboon, bovine, chicken, and turkey GDF-8 amino acid sequences (SEQ ID NO: 12, 25, 
14, 29, 3 1, 19, 21, 23 and 27, respectively). 

Figure 16 shows the predicted amino acid sequences of murine and human GDF-11 
aligned with murine (McPherron et al., 1997) and human (McPherron and Lee, 1997) 
myostatin (MSTN). Shaded boxes represent amino acid homology with the murine and 
15 human GDF-1 1 sequences. Amino acids are numbered relative to the human GDF-1 1 
sequence. The predicted proteolytic processing sites are located at amino acids 295-298. 

Figure 17 shows the construction of GDF-1 1 null mice by homologous targeting, a) is 
a map of the GDF-1 1 locus (top line) and targeting construct (second line). The black 
and stippled boxes represent coding sequences for the pro-and C-terminal regions, 
20 respectively. The targeting construct contains a total of 1 1 kb of homology with the 
GDF-1 1 gene. A probe derived from the region upstream of the 3' homology fragment 
and downstream of the first EcoRI site shown hybridizes to a 6.5 kb EcoRl fragment in 
the GDF-1 1 gene and a 4.8 kb fragment in a homologously targeted gene. Abbrevia- 
tions: X, Xbal; E, EcoRl. b) Geneomic Southern of DNA prepared from Fl heterozy- 
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gous mutant mice (lanes 1 and 2) and offspring derived from a mating of these mice 
(lanes 3-12). 

Figure 18 shows kidney abnormalities in GDF-1 1 knockout mice. Kidneys of newborn 
animals were examined and classified according to the number of normal sized or small 
5 kidneys as shown at the top. Numbers in the table indicate number of animals falling 
into each classification according to genotype. 

Figure 19 shows homeotic transformations in GDF-1 1 mutant mice, a) Newborn pups 
with missing (first and second from left) and normal looking tails, b-j) Skeleton 
preparations for newborn wild-type (b, e, h), heterozygous (c, f, I) and homozygous (d, 
10 g, j) mutant mice. Whole skeleton preparations (b-d), vertebral columns (e-g), 
vertebrosternal ribs (h-j) showing transformations and defects in homozygous and 
heterozygous mutant mice. Numbers indicate thoracic segments. 

Figure 20 is a table summarizing the anterior transformations in wild-type, heterozygous 
and homozygous GDF-1 1 mice. 

1 5 DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides a growth and differentiation factor, GDF-8 and a 
polynucleotide sequence encoding GDF-8. GDF-8 is expressed at highest levels in 
muscle and at lower levels in adipose tissue. 

The animals contemplated for use in the practice of the subject invention are those 
20 animals generally regarded as useful for the processing of food stuffs, i.e. avian such as 
meat bred and egg laying chicken and turkey, ovine such as lamb, bovine such as beef 
cattle and milk cows, piscine and porcine. For purposes of the subject invention, these 
animals are referred to as "transgenic" when such animal has had a heterologous DNA 
sequence, or one or more additional DNA sequences normally endogenous to the animal 
25 (collectively referred to herein as "transgenes") chromosomally integrated into the germ 
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cells of the animal. The transgenic animal (including its progeny) will also have the 
transgene integrated into the chromosomes of somatic cells. 

The TGF-p superfamily consists of multifunctional polypeptides that control prolifera- 
tion, differentiation, and other functions in many cell types. Many of the peptides have 
5 regulatory, both positive and negative, effects on other peptide growth factors. The 
structural homology between the GDF-8 protein of this invention and the members of the 
TGF-P family, indicates that GDF-8 is a new member of the family of growth and 
differentiation factors. Based on the known activities of many of the other members, it 
can be expected that GDF-8 will also possess biological activities that will make it useful 
10 as a diagnostic and therapeutic reagent. 

In particular, certain members of this superfamily have expression patterns or possess 
activities that relate to the function of the nervous system. For example, the inhibins and 
activins have been shown to be expressed in the brain (Meunier, et al, Proc. Natl. Acad. 
ScL, USA, 85:247, 1988; Sawchenko, et al, Nature, 334:615, 1988), and activin has been 

15 shown to be capable of functioning as a nerve cell survival molecule (Schubert, et al, 
Nature, 344:868, 1990). Another family member, namely, GDF-1, is nervous sys- 
tem-specific in its expression pattern (Lee, S J., Proc, Natl Acad. ScL, USA, 88:4250, 
1991), and certain other family members, such as Vgr-1 (Lyons, et ai, Proc. Natl Acad. 
Sci., USA, 86:4554, 1989; Jones, et al t Development, i]J_:531, 1991), OP-1 (Ozkaynak, 

20 et al, J. Biol Chem., 267:25220, 1992), and BMP-4 (Jones, et al t Development, 
111 :531, 1991), are also known to be expressed in the nervous system. Because it is 
known that skeletal muscle produces a factor or factors that promote the survival of 
motor neurons (Brown, Trends NeuroscL, 7:10, 1984), the expression of GDF-8 in 
muscle suggests that one activity of GDF-8 may be as a trophic factor for neurons. In this 

25 regard, GDF-8 may have applications in the treatment of neurodegenerative diseases, 
such as amyotrophic lateral sclerosis or muscular dystrophy, or in maintaining cells or 
tissues in culture prior to transplantation. 
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GDF-8 may also have applications in treating disease processes involving the 
musculoskeletal system, such as in musculodegenerative diseases, osteoporosis or in 
tissue repair due to trauma. In this regard, many other members of the TGF-p family are 
also important mediators of tissue repair. TGF-p has been shown to have marked effects 
5 on the formation of collagen and to cause a striking angiogenic response in the newborn 
mouse (Roberts, et al, Proc. Natl Acad. Sci., USA 83:4167, 1986). TGF-p has also been 
shown to inhibit the differentiation of myoblasts in culture (Massague, et al, Proc. Natl 
Acad. Sci. t USA 83:8206, 1986). Moreover, because myoblast cells may be used as a 
vehicle for delivering genes to muscle for gene therapy, the properties of GDF-8 could 
10 be exploited for maintaining cells prior to transplantation or for enhancing the efficiency 
of the fusion. GDF-8 may also have applications in treating disease processes involving 
the kidney or in kidney repair due to trauma. 

The expression of GDF-8 in adipose tissue also raises the possibility of applications for 
GDF-8 in the treatment of obesity or of disorders related to abnormal proliferation of 
15 adipocytes. In this regard, TGF-p has been shown to be a potent inhibitor of adipocyte 
differentiation in vitro (Ignotz and Massague, Proc. Natl Acad. ScL t USA 82:8530, 
1985). 

Polypeptides. Polynucleotides. Vectors and Host Cells 

The invention provides substantially pure GDF-8 polypeptide and isolated polynucleo- 
20 tides that encode GDF-8. The term "substantially pure" as used herein refers to GDF-8 
which is substantially free of other proteins, lipids, carbohydrates or other materials with 
which it is naturally associated. One skilled in the art can purify GDF-8 using standard 
techniques for protein purification. The substantially pure polypeptide will yield a single 
major band on a non-reducing polyacrylamide gel. The purity of the GDF-8 polypeptide 
25 can also be determined by amino-terminal amino acid sequence analysis. GDF-8 
polypeptide includes functional fragments of the polypeptide, as long as the activity of 
GDF-8 remains. Smaller peptides containing the biological activity of GDF-8 are 
included in the invention. 
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The invention provides polynucleotides encoding the GDF-8 protein. These polynucleo- 
tides include DNA, cDNA and RNA sequences which encode GDF-8. It is understood 
that all polynucleotides encoding all or a portion of GDF-8 are also included herein, as 
long as they encode a polypeptide with GDF-8 activity. Such polynucleotides include 
5 naturally occurring, synthetic, and intentionally manipulated polynucleotides. For 
example, GDF-8 polynucleotide may be subjected to site-directed mutagenesis. The 
polynucleotide sequence for GDF8 also includes antisense sequences. The polynucleo- 
tides of the invention include sequences that are degenerate as a result of the genetic 
code. There are 20 natural amino acids, most of which are specified by more than one 
10 codon. Therefore, all degenerate nucleotide sequences are included in the invention as 
long as the amino acid sequence of GDF-8 polypeptide encoded by the nucleotide 
sequence is functionally unchanged. 

Specifically disclosed herein is a genomic DNA sequence containing a portion of the 
GDF-8 gene. The sequence contains an open reading frame corresponding to the 

15 predicted C-terminal region of the GDF-8 precursor protein. The encoded polypeptide 
is predicted to contain two potential proteolytic processing sites (KR and RR). Cleavage 
of the precursor at the downstream site would generate a mature biologically active 
C-terminal fragment of 109 and 103 amino acids for murine and human species, 
respectively, with a predicted molecular weight of approximately 12,400. Also disclosed 

20 are full length murine and human GDF-8 cDNA sequences. The murine pre-pro-GDF-8 
protein is 376 amino acids in length, which is encoded by a 2676 base pair nucleotide 
sequence, beginning at nucleotide 1 04 and extending to a TG A stop codon at nucleotide 
1232. The human GDF-8 protein is 375 amino acids and is encoded by a 2743 base pair 
sequence, with the open reading frame beginning at nucleotide 59 and extending to 

25 nucleotide 1 184. GDF-8 is also capable of forming dimers, or heterodimers, with an 
expected molecular weight of approximately 23-30KD (see Example 4). For example, 
GDF-8 may form heterodimers with other family members, such as GDF-1 1. 

Also provided herein are the biologically active C-terminal fragments of chicken (Figure 
2c) and rat (Figure 2d) GDF-8. The full length nucleotide and deduced amino acid 
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sequences for baboon, bovine, chicken, rat, ovine, porcine, and turkey are shown in 
Figures 14a-g and human and murine are shown in Figure 5. As shown in Figure 3b, 
alignment of the amino acid sequences of human, murine, rat and chicken GDF-8 
indicate that the sequences are 100% identical in the C-terminal biologically active 

5 fragment. Figure 1 5 a and 1 5b also show the alignment of GDF-8 amino acid sequences 
for murine, rat, human, baboon, porcine, ovine, bovine, chicken and turkey. Given the 
extensive conservation of amino acid sequences between species, it would now be 
routine for one of skill in the art to obtain the GDF-8 nucleic acid and amino acid 
sequence for GDF-8 from any species, including those provided herein, as well as 

10 piscine, for example. 

The C-teiminal region of GDF-8 following the putative proteolytic processing site shows 
significant homology to the known members of the TGF-p superfamily. The GDF-8 
sequence contains most of the residues that are highly conserved in other family 
members and in other species (see FIGURES 3a and 3b and 15 a and 15b). Like the 
15 TGF-ps and inhibin Ps, GDF-8 contains an extra pair of cysteine residues in addition to 
the 7 cysteines found in virtually all other family members. Among the known family 
members, GDF-8 is most homologous to Vgr-1 (45% sequence identity) (see FIGURE 
4). 

Minor modifications of the recombinant GDF-8 primary amino acid sequence may result 
20 in proteins which have substantially equivalent activity as compared to the GDF-8 
polypeptide described herein. Such modifications may be deliberate, as by site-directed 
mutagenesis, or may be spontaneous. All of the polypeptides produced by these 
modifications are included herein as long as the biological activity of GDF-8 still exists. 
Further, deletion of one or more amino acids can also result in a modification of the 
25 structure of the resultant molecule without significantly altering its biological activity. 
This can lead to the development of a smaller active molecule which would have broader 
utility. For example, one can remove amino or carboxy terminal amino acids which are 
not required for GDF-8 biological activity. 
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The nucleotide sequence encoding the GDF-8 polypeptide of the invention includes the 
disclosed sequence and conservative variations thereof. The term "conservative 
variation" as used herein denotes the replacement of an amino acid residue by another, 
biologically similar residue. Examples of conservative variations include the substitution 
5 of one hydrophobic residue such as isoleucine, valine, leucine or methionine for another, 
or the substitution of one polar residue for another, such as the substitution of arginine 
for lysine, glutamic for aspartic acid, or glutamine for asparagine, and the like. The term 
"conservative variation" also includes the use of a substituted amino acid in place of an 
unsubstituted parent amino acid provided that antibodies raised to the substituted 
10 polypeptide also immuno react with the unsubstituted polypeptide. 

DNA sequences of the invention can be obtained by several methods. For example, the 
DNA can be isolated using hybridization techniques which are well known in the art. 
These include, but are not limited to: 1) hybridization of genomic or cDNA libraries with 
probes to detect homologous nucleotide sequences, 2) polymerase chain reaction (PCR) 
1 5 on genomic DNA or cDN A using primers capable of annealing to the DNA sequence of 
interest, and 3) antibody screening of expression libraries to detect cloned DNA 
fragments with shared structural features. 

Preferably the GDF-8 polynucleotide of the invention is derived from a mammalian 
organism, and most preferably from mouse, rat, cow, pig, or human. GDF-8 polynucleo- 

20 tides from chicken, turkey, fish and other species are also included herein. Screening 
procedures which rely on nucleic acid hybridization make it possible to isolate any gene 
sequence from any organism, provided the appropriate probe is available. Given the 
extensive nucleotide and amino acid homology between species, it would be routine for 
one of skill in the art to obtain polynucleotides encoding GDF-8 from any species. - 

25 Oligonucleotide probes, which correspond to a part of the sequence encoding the protein 
in question, can be synthesized chemically. This requires that short, oligopeptide 
stretches of amino acid sequence must be known. The DNA sequence encoding the 
protein can be deduced from the genetic code, however, the degeneracy of the code must 
be taken into account. It is possible to perform a mixed addition reaction when the 
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sequence is degenerate. This includes a heterogeneous mixture of denatured dou- 
ble-stranded DNA. For such screening, hybridization is preferably performed on either 
single-stranded DNA or denatured double-stranded DNA. Hybridization is particularly 
useful in the detection of cDNA clones derived from sources where an extremely low 
5 amount of mRNA sequences relating to the polypeptide of interest are present. In other 
words, by using stringent hybridization conditions directed to avoid non-specific binding, 
it is possible, for example, to allow the autoradiographic visualization of a specific cDNA 
clone by the hybridization of the target DNA to that single probe in the mixture which 
is its complete complement (Wallace, et al, Nucl. Acid Res. 9:879, 1981). 

10 The development of specific DNA sequences encoding GDF-8 can also be obtained by: 
1 ) isolation of double-stranded DNA sequences from the genomic DNA; 2) chemical 
manufacture of a DNA sequence to provide the necessary codons for the polypeptide of 
interest; and 3) in vitro synthesis of a doublestranded DNA sequence by reverse 
transcription of mRNA isolated from a eukaryotic donor cell. In the latter case, a 

15 double-stranded DNA complement of mRNA is eventually formed which is generally 
referred to as cDNA. 

Of the three above-noted methods for developing specific DNA sequences for use in 
recombinant procedures, the isolation of genomic DNA isolates is the least common. 
This is especially true when it is desirable to obtain the microbial expression of 
20 mammalian polypeptides due to the presence of introns. 

The synthesis of DNA sequences is frequently the method of choice when the entire 
sequence of amino acid residues of the desired polypeptide product is known. When the 
entire sequence of amino acid residues of the desired polypeptide is not known, the direct 
synthesis of DNA sequences is not possible and the method of choice is the synthesis of 
25 cDNA sequences. Among the standard procedures for isolating cDNA sequences of 
interest is the formation of plasmid- or phage^arrying cDNA libraries which are derived 
from reverse transcription of mRNA which is abundant in donor cells that have a high 
level of genetic expression. When used in combination with polymerase chain reaction 
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technology, even rare expression products can be cloned. In those cases where significant 
portions of the amino acid sequence of the polypeptide are known, the production of 
labeled single or double-stranded DN A or RNA probe sequences duplicating a sequence 
putatively present in the target cDNA may be employed in DNA/DNA hybridization 
5 procedures which are carried out on cloned copies of the cDNA which have been 
denatured into a single-stranded form (Jay, et aL, Nucl Acid Res. y J_l:2325, 1983). 

A cDNA expression library, such as lambda gtl 1, can be screened indirectly for GDF-8 
peptides having at least one epitope, using antibodies specific for GDF-8. Such 
antibodies can be either polyclonally or monoclonally derived and used to detect 
10 expression product indicative of the presence of GDF-8 cDNA. 

In nucleic acid hybridization reactions, the conditions used to achieve a particular level 
of stringency will vary, depending on the nature of the nucleic acids being hybridized. 
For example, the length, degree of complementarity, nucleotide sequence composition 
(e.g., GC v. AT content), and nucleic acid type (e.g., RNA v. DNA) of the hybridizing 
15 regions of the nucleic acids can be considered in selecting hybridization conditions. An 
additional consideration is whether one of the nucleic acids is immobilized, for example, 
on a filter. 

An example of progressively higher stringency conditions is as follows: 2 x SSC/0.1% 
SDS at about room temperature (hybridization conditions); 0.2 x SSC/0.1% SDS at about 

20 room temperature (low stringency conditions); 0.2 x SSC/0.1% SDS at about 42 °C 
(moderate stringency conditions); and 0.1 x SSC at about 68 °C (high stringency 
conditions). Washing can be carried out using only one of these conditions, e.g., high 
stringency conditions, or each of the conditions can be used, e.g., for 10-1 5 minutes each, 
in the order listed above, repeating any or all of the steps listed. However, as mentioned 

25 above, optimal conditions will vary, depending on the particular hybridization reaction 
involved, and can be determined empirically. 
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DNA sequences encoding GDF-8 can be expressed in vitro by DNA transfer into a 
suitable host cell. "Host cells" are cells in which a vector can be propagated and its DNA 
expressed. The term also includes any progeny of the subject host cell. It is understood 
that all progeny may not be identical to the parental cell since there may be mutations 
5 that occur during replication. However, such progeny are included when the term "host 
cell" is used. Methods of stable transfer, meaning that the foreign DNA is continuously 
maintained in the host, are known in the art. 

In the present invention, the GDF-8 polynucleotide sequences may be inserted into a 
recombinant expression vector. The term "recombinant expression vector" refers to a 
1 0 plasmid, virus or other vehicle known in the art that has been manipulated by insertion 
or incorporation of the GDF-8 genetic sequences. Such expression vectors contain a 
promoter sequence which facilitates the efficient transcription of the inserted genetic 
sequence of the host. The expression vector typically contains an origin of replication, 
a promoter, as well as specific genes which allow phenotypic selection of the trans- 
1 5 formed cells. Vectors suitable for use in the present invention include, but are not limited 
to the T7-based expression vector for expression in bacteria (Rosenberg, et al, Gene, 
56:125, 1987), the pMSXND expression vector for expression in mammalian cells (Lee 
and Nathans, J. Biol. Chem., 263:3521, 1988) and baculovirus-derived vectors for 
expression in insect cells. The DNA segment can be present in the vector operably linked 
20 to regulatory elements, for example, a promoter (e.g., T7, metallothionein 1, or 
polyhedrin promoters). 

Polynucleotide sequences encoding GDF-8 can be expressed in either prokaryotes or 
eukaryotes. Hosts can include microbial, yeast, insect and mammalian organisms. 
Methods of expressing DNA sequences having eukaryotic or viral sequences in 
25 prokaryotes are well known in the art. Biologically functional viral and plasmid DNA 
vectors capable of expression and replication in a host are known in the art. Such vectors 
are used to incorporate DNA sequences of the invention. Preferably, the mature 
C-terminal region of GDF-8 is expressed from a cDNA clone containing the entire 
coding sequence of GDF-8. Alternatively, the C-terminal portion of GDF-8 can be 
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expressed as a fusion protein with the pro- region of another member of the TGF-P 
family or co-expressed with another pro-region (see for example, Hammonds, et al, 
Molec. Endocrin., 5:149, 1991; Gray, A., and Mason, A., Science, 247:1328, 1990). 

Transformation of a host cell with recombinant DNA may be carried out by conventional 
5 techniques as are well known to those skilled in the art. Where the host is prokaryotic, 
such as E coli, competent cells which are capable of DNA uptake can be prepared from 
cells harvested after exponential growth phase and subsequently treated by the CaCl 2 
method using procedures well known in the art. Alternatively, MgCl 2 or RbCl can be 
used. Transformation can also be performed after forming a protoplast of the host cell if 
10 desired. 

When the host is a eukaryote, such methods of transfection of DNA as calcium phosphate 
co-precipitates, conventional mechanical procedures such as microinjection, 
electroporation, insertion of a plasmid encased in liposomes, or virus vectors may be 
used. Eukaryotic cells can also be cotransformed with DNA sequences encoding the 
15 GDF-8 of the invention, and a second foreign DNA molecule encoding a selectable 
phenotype, such as the herpes simplex thymidine kinase gene. Another method is to use 
a eukaryotic viral vector, such as simian virus 40 (SV40) or bovine papilloma virus, to 
transiently infect or transform eukaryotic cells and express the protein, (see for example, 
Eukaryotic Viral Vectors, Cold Spring Harbor Laboratory, Gluzman ed., 1982). 

20 Isolation and purification of microbial expressed polypeptide, or fragments thereof, 
provided by the invention, may be carried out by conventional means including 
preparative chromatography and immunological separations involving monoclonal or 
polyclonal antibodies. 

GDF-8 Antibodies and Methods of Use 
25 The invention includes antibodies immunoreactive with GDF-8 polypeptide or functional 
fragments thereof. Antibody which consists essentially of pooled monoclonal antibodies 
with different epitopic specificities, as well as distinct monoclonal antibody preparations 
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are provided. Monoclonal antibodies are made from antigen containing fragments of the 
protein by methods well known to those skilled in the art (Kohler, et al, Nature, 
256:495, 1975). The term antibody as used in this invention is meant to include intact 
molecules as well as fragments thereof, such as Fab and V{sh\, Fv and SCA fragments 
5 which are capable of binding an epitopic determinant on GDF-8. 

(1) An Fab fragment consists of a monovalent antigen-binding fragment of an antibody 
molecule, and can be produced by digestion of a whole antibody molecule with the 
enzyme papain, to yield a fragment consisting of an intact light chain and a portion of a 
heavy chain. 

10 (2) An Fab' fragment of an antibody molecule can be obtained by treating a whole 
antibody molecule with pepsin, followed by reduction, to yield a molecule consisting of 
an intact light chain and a portion of a heavy chain. Two Fab' fragments are obtained per 
antibody molecule treated in this manner. 

(3) An (Fab') 2 fragment of an antibody can be obtained by treating a whole antibody 
1 5 molecule with the enzyme pepsin, without subsequent reduction. A (Fab') 2 fragment is 

a dimer of two Fab' fragments, held together by two disulfide bonds. 

(4) An Fv fragment is defined as a genetically engineered fragment containing the 
variable region of a light chain and the variable region of a heavy chain expressed as two 
chains. 

20 (5) A single chain antibody ("SCA") is a genetically engineered single chain molecule 
containing the variable region of a light chain and the variable region of a heavy chain, 
linked by a suitable, flexible polypeptide linker. 

As used in this invention, the term "epitope" refers to an antigenic determinant on an 
antigen, such as a GDF-8 polypeptide, to which the paratope of an antibody, such as an 
25 GDF-8-specific antibody, binds. Antigenic determinants usually consist of chemically 
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active surface groupings of molecules, such as amino acids or sugar side chains, and can 
have specific three-dimensional structural characteristics, as well as specific charge 
characteristics. 

As is mentioned above, antigens that can be used in producing GDF-8-specific antibodies 
5 include GDF-8 polypeptides or GDF-8 polypeptide fragments. The polypeptide or 
peptide used to immunize an animal can be obtained by standard recombinant, chemical 
synthetic, or purification methods. As is well known in the art, in order to increase 
immunogenicity, an antigen can be conjugated to a carrier protein. Commonly used 
carriers include keyhole limpet hemocyanin (KLH), thyroglobulin, bovine serum albumin 
10 (BSA), and tetanus toxoid. The coupled peptide is then used to immunize the animal 
(e.g., a mouse, a rat, or a rabbit). In addition to such carriers, well known adjuvants can 
be administered with the antigen to facilitate induction of a strong immune response. 

The term "cell-proliferative disorder" denotes malignant as well as non-malignant cell 
populations which often appear to differ from the surrounding tissue both morphologi- 

15 cally and genotypically. Malignant cells (i.e. cancer) develop as a result of a multistep 
process. The GDF-8 polynucleotide that is an antisense molecule or that encodes a 
dominant negative GDF-8 is useful in treating malignancies of the various organ systems, 
particularly, for example, cells in muscle, bone, kidney or adipose tissue. Essentially, any 
disorder which is etiologically linked to altered expression of GDF-8 could be considered 

20 susceptible to treatment with a GDF-8 agent (e.g., a suppressing or enhancing agent). 
One such disorder is a malignant cell proliferative disorder, for example. 

The invention provides a method for detecting a cell proliferative disorder of muscle, 
bone, kidney or adipose tissue which comprises contacting an anti-GDF-8 antibody with 
a cell suspected of having a GDF-8 associated disorder and detecting binding to the 
25 antibody. The antibody reactive with GDF-8 is labeled with a compound which allows 
detection of binding to GDF-8. For purposes of the invention, an antibody specific for 
GDF-8 polypeptide may be used to detect the level of GDF-8 in biological fluids and 
tissues. Any specimen containing a detectable amount of antigen can be used. Preferred 
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samples of this invention include muscle, bone or kidney tissue. The level of GDF-8 in 
the suspect cell can be compared with the level in a normal cell to determine whether the 
subject has a GDF-8-associated cell proliferative disorder. Such methods of detection 
are also useful using nucleic acid hybridization to detect the level of GDF-8 mRNA in 
5 a sample or to detect an altered GDF-8 gene. Preferably the subject is human. 

The antibodies of the invention can be used in any subject in which it is desirable to 
administer in vitro or in vivo immunodiagnosis or immunotherapy. The antibodies of the 
invention are suited for use, for example, in immunoassays in which they can be utilized 
in liquid phase or bound to a solid phase carrier. In addition, the antibodies in these 
10 immunoassays can be detectably labeled in various ways. Examples of types of 
immunoassays which can utilize "antibodies of the invention are competitive and 
non-competitive immunoassays in either a direct or indirect format. Examples of such 
immunoassays are the radioimmunoassay (RIA) and the sandwich (immunometric) assay. 
Detection of the antigens using the antibodies of the invention can be done utmzmg 
15 immunoassays which are run in either the forward, reverse, or simultaneous modes, 
including immunohistochemical assays on physiological samples. Those of skill in the 
art will know, or can readily discern, other immunoassay formats without undue 
experimentation. 

The antibodies of the invention can be bound to many different carriers and used to 
20 detect the presence of an antigen comprising the polypeptide of the invention. Examples 
of well-known carriers include glass, polystyrene, polypropylene, polyethylene, dextran, 
nylon, amylases, natural and modified celluloses, polyacrylamides, agaroses and 
magnetite. The nature of the carrier can be either soluble or insoluble for purposes of the 
invention. Those skilled in the art will know of other suitable carriers for binding 
25 antibodies, or will be able to ascertain such, using routine experimentation. 

There are many different labels and methods of labeling known to those of ordinary skill 
in the art. Examples of the types of labels which can be used in the present invention 
include enzymes, radioisotopes, fluorescent compounds, colloidal metals, 
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chemiluminescent compounds, phosphorescent compounds, and bioluminescent 
compounds. Those of ordinary skill in the art will know of other suitable labels for 
binding to the antibody, or will be able to ascertain such, using routine experimentation. 

Another technique which may also result in greater sensitivity consists of coupling the 
5 antibodies to low molecular weight haptens. These haptens can then be specifically 
detected by means of a second reaction. For example, it is common to use such haptens 
as biotin, which reacts with avidin, or dinitrophenyi, puridoxal, and fluorescein, which 
can react with specific antihapten antibodies. 

In using the monoclonal antibodies of the invention for the in vivo detection of antigen, 
10 the detectably labeled antibody is given a dose which is diagnostically effective. The 
term "diagnostically effective" means that the amount of detectably labeled monoclonal 
antibody is administered in sufficient quantity to enable detection of the site having the 
antigen comprising a polypeptide of the invention for which the monoclonal antibodies 
are specific. 

15 The concentration of detectably labeled monoclonal antibody which is administered 
should be sufficient such that the binding to those cells having the polypeptide is 
detectable compared to the background. Further, it is desirable that the detectably labeled 
monoclonal antibody be rapidly cleared from the circulatory system in order to give the 
best target-to-background signal ratio. 

20 As a rule, the dosage of detectably labeled monoclonal antibody for in vivo diagnosis will 
vary depending on such factors as age, sex, and extent of disease of the individual. Such 
dosages may vary, for example, depending on whether multiple injections are given, 
antigenic burden, and other factors known to those of skill in the art. 

For in vivo diagnostic imaging, the type of detection instrument available is a major 
25 factor in selecting a given radioisotope. The radioisotope chosen must have a type of 
decay which is detectable for a given type of instrument. Still another important factor 
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in selecting a radioisotope for in vivo diagnosis is that deleterious radiation with respect 
to the host is minimized. Ideally, a radioisotope used for in vivo imaging will lack a 
particle emission, but produce a large number of photons in the 140-250 keV range, 
which may readily be detected by conventional gamma cameras. 

5 For in vivo diagnosis radioisotopes may be bound to immunoglobulin either directly or 
indirectly by using an intermediate functional group, intermediate functional groups 
which often are used to bind radioisotopes which exist as metallic ions to 
immunoglobulins are the Afunctional chelating agents such as 
diethylenetriaminepentacetic acid (DTPA) and ethylenediaminetetraacetic acid (EDTA) 

10 and similar molecules. Typical examples of metallic ions which can be bound to the 
monoclonal antibodies of the invention are 1 "In, 97 Ru, 67 Ga, 68 Ga, "As^Zr and 20, T1. 

The monoclonal antibodies of the invention can also be labeled with a paramagnetic 
isotope for purposes of in vivo diagnosis, as in magnetic resonance imaging (MRI) or 
electron spin resonance (ESR). In general, any conventional method for visualizing 
15 diagnostic imaging can be utilized. Usually gamma and positron emitting radioisotopes 
are used for camera imaging and paramagnetic isotopes for MRI. Elements which are 
particularly useful in such techniques include ,57 Gd, 55 Mn, ,62 Dy, 52 Cr, and 56 Fe. 

The monoclonal antibodies of the invention can be used in vitro and in vivo to monitor 
the course of amelioration of a GDF-8-associated disease in a subject. Thus, for example, 

20 by measuring the increase or decrease in the number of cells expressing antigen 
comprising a polypeptide of the invention or changes in the concentration of such antigen 
present in various body fluids, it would be possible to determine whether a particular 
therapeutic regimen aimed at ameliorating the GDF-8-associated disease is effective. The 
term "ameliorate" denotes a lessening of the detrimental effect of the GDF-8-associated 

25 disease in the subject receiving therapy. 
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Additional Methods of Treatment and Diagnosis 

The present invention identifies a nucleotide sequence that can be expressed in an altered 
manner as compared to expression in a normal cell, therefore it is possible to design 
appropriate therapeutic or diagnostic techniques directed to this sequence. Treatment 
5 includes administration of a reagent which modulates activity. The term "modulate" 
envisions the suppression or expression of GDF-8 when it is over-expressed, or 
augmentation of GDF-8 expression when it is underexpressed. When a muscle or bone- 
associated disorder is associated with GDF-8 overexpression, such suppressive reagents 
as antisense GDF-8 polynucleotide sequence, dominant negative sequences or GDF-8 

10 binding antibody can be introduced into a cell. In addition, an anti-idiotype antibody 
which binds to a monoclonal antibody which binds GDF-8 of the invention, or an epitope 
thereof, may also be used in the therapeutic method of the invention. Alternatively, when 
a cell proliferative disorder is associated with underexpression or expression of a mutant 
GDF-8 polypeptide, a sense polynucleotide sequence (the DNA coding strand) or GDF-8 

15 polypeptide can be introduced into the cell. Such muscle or bone-associated disorders 
include cancer, muscular dystrophy, spinal cord injury, traumatic injury, congestive 
obstructive pulmonary disease (COPD), AIDS or cachecia. In addition, the method of 
the invention can be used in the treatment of obesity or of disorders related to abnormal 
proliferation of adipocytes. One of skill in the art can determine whether or not a 

20 particular therapeutic course of treatment is successful by several methods described 
herein (e.g., muscle fiber analysis or biopsy; determination of fat content). The present 
examples demonstrate that the methods of the invention are useful for decreasing fat 
content, and therefore would be useful in the treatment of obesity and related disorders 
(e.g., diabetes). Neurodegenerative disorders are also envisioned as treated by the 

25 method of the invention. 

Thus, where a cell-proliferative disorder is associated with the expression of GDF-8, 
nucleic acid sequences that interfere with GDF-8 expression at the translational level can 
be used. This approach utilizes, for example, antisense nucleic acid and ribozymes to 
block translation of a specific GDF-8 mRNA, either by masking that mRNA with an 
30 antisense nucleic acid or by cleaving it with a ribozyme. Such disorders include 



WO 99/401 o. 



PCT/US99/02S11 



26 



neurodegenerative diseases, for example. In addition, dominant-negative GDF-8 mutants 
would be useful to actively interfere with function of "normal" GDF-8. 

Antisense nucleic acids are DNA or RNA molecules that are complementary to at least 
a portion of a specific mRNA molecule (Weintraub, Scientific American, 262:40, 1 990). 
5 In the cell, the antisense nucleic acids hybridize to the corresponding mRNA, forming 
a double-stranded molecule. The antisense nucleic acids interfere with the translation of 
the mRNA, since the cell will not translate a mRNA that is double-stranded. 

Antisense oligomers of about 15 nucleotides are preferred, since they are easily 
synthesized and are less likely to cause problems than larger molecules when introduced 
1 0 into the target GDF-8-producing cell. The use of antisense methods to inhibit the in vitro 
translation of genes is well known in the art (Marcus-Sakura, Anal. Biochem., 172:289, 
1988). 

Ribozymes are RNA molecules possessing the ability to specifically cleave other 
single-stranded RNA in a manner analogous to DNA restriction endonucleases. Through 
15 the modification of nucleotide sequences which encode these RNAs, it is possible to 
engineer molecules that recognize specific nucleotide sequences in an RNA molecule and 
cleave it (Cech, J. Amer. Med. Assn., 260:3030, 1988). A major advantage of this 
approach is that, because they are sequence-specific, only mRNAs with particular 
sequences are inactivated. 

20 There are two basic types of ribozymes namely, tetrahymena-type (Hasselhoff, Nature, 
334:585, 1988) and "hammerhead"-type. Tetrahymena-type ribozymes recognize 
sequences which are four bases in length, while "hammerhead"-type ribozymes recognize 
base sequences 11-18 bases in length. The longer the recognition sequence, the greater 
the likelihood that the sequence will occur exclusively in the target mRNA species. 

25 Consequently, hammerhead-type ribozymes are preferable to tetrahymena-type 
ribozymes for inactivating a specific mRNA species and 1 8-based recognition sequences 
are preferable to shorter recognition sequences. 
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In another embodiment of the present invention, a nucleotide sequence encoding a GDF- 
8 dominant negative protein is provided. For example, a genetic construct that contain 
such a dominant negative encoding gene may be operably linked to a promoter, such as 
a tissue-specific promoter. For example, a skeletal muscle specific promoter (e.g., 
5 human skeletal muscle cc-actin promoter) or developmental ly specific promoter (e.g., 
MyHC 3, which is restricted in skeletal muscle to the embryonic period of development, 
or an inducible promoter (e.g., the orphan nuclear receptor TIS1). 

Such constructs are useful in methods of modulating a subject's skeletal mass. For 
example, a method include transforming an organism, tissue, organ or cell with a genetic 
1 0 construct encoding a dominant negative GDF-8 protein and suitable promoter in operable 
linkage and expressing the dominant negative encoding GDF-8 gene, thereby modulating 
muscle and/or bone mass by interfering with wild-type GDF-8 activity. 

GDF-8 most likely forms dimers, homodimers or heterddimers and may even form 
heterodimers with other GDF family members, such as GDF-1 1 (see Example 4). Hence, 

15 while not wanting to be bound by a particular theory, the dominant negative effect 
described herein may involve the formation of non-functional homodimers or 
heterodimers of dominant negative and wild-type GDF-8 monomers. More specifically, 
it is possible that any non-functional homodimer or any heterodimer formed by the 
dimerization of wild-type and/or dominant negative GDF-8 monomers produces a 

20 dominant effect by: 1) being synthesized but not processed or secreted; 2) inhibiting the 
secretion of wild type GDF-8; 3) preventing normal proteolytic cleavage of the 
preprotein thereby producing a nonfunctional GDF-8 molecule; 4) altering the affinity 
of the non-functional dimer (e.g., homodimeric or heterodimeric GDF-8) to a receptor 
or generating an antagonistic form of GDF-8 that binds a receptor without activating it; 

25 or 5) inhibiting the intracellular processing or secretion of GDF-8 related or TGF-B 
family proteins. 

Non- functional GDF-8 can function to inhibit the growth regulating actions of GDF-8 
on muscle and bone cells that include a dominant negative GDF-8 gene. Deletion or 
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missense dominant negative forms of GDF-8 that retain the ability to form dimers with 
wild- type GDF-8 protein but do not function as wild-type GDF-8 proteins may be used 
to inhibit the biological activity of endogenous wild- type GDF-8. For example, in one 
embodiment, the proteolytic processing site of GDF-8 may be altered (e.g., deleted) 
5 resulting in a GDF-8 molecule able to undergo subsequent dimerization with endogenous 
wild-type GDF-8 but unable to undergo further processing into a mature GDF-8 form. 
Alternatively, a non-functional GDF-8 can function as a monomelic species to inhibit 
the growth regulating actions of GDF-8 on muscle or bone cells. 

Any genetic recombinant method in the art may be used, for example, recombinant 
1 0 viruses may be engineered to express a dominant negative form of GDF-8 which may be 
used to inhibit the activity of wild-type GDF-8. Such viruses may be used therapeuti- 
cally for treatment of diseases resulting from aberrant over-expression or activity of 
GDF-8 protein, such as in denervation hypertrophy or as a means of controlling GDF-8 
expression when treating disease conditions involving the musculoskeletal system, such 
15 as in musculodegenerative diseases, osteoporosis or in tissue repair due to trauma or in 
modulating GDF-8 expression in animal husbandry (e.g., transgenic animals for 
agricultural purposes). 

The invention provides a method for treating a muscle, bone, kidney (chronic or acute) 
or adipose tissue disorder in a subject. The method includes administering a therapeuti- 

20 cally effective amount of a GDF-8 agent to the subject, thereby inhibiting abnormal 
growth of muscle, bone, kidney or adipose tissue. The GDF-8 agent may include a GDF- 
8 antisense molecule or a dominant negative polypeptide, for example. A "therapeuti- 
cally effective amount" of a GDF-8 agent is that amount that ameliorates symptoms of 
the disorder or inhibits GDF-8 induced growth of muscle or bone, for example, as 

25 compared with a normal subject. 
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Gene Therapy 

The present invention also provides gene therapy for the treatment of cell proliferative 
or immunologic disorders which are mediated by GDF-8 protein. Such therapy would 
achieve its therapeutic effect by introduction of the GDF-8 antisense or dominant 
5 negative encoding polynucleotide into cells having the proliferative disorder. Delivery 
of antisense or dominant negative GDF-8 polynucleotide can be achieved using a 
recombinant expression vector such as a chimeric virus or a colloidal dispersion system. 
Especially preferred for therapeutic delivery of antisense or dominant negative sequences 
is the use of targeted liposomes. In contrast, when it is desirable to enhance GDF-8 
10 production, a "sense" GDF-8 polynucleotide or functional equivalent {e.g., the C-term 
active region) is introduced into the appropriate cell(s). 

Various viral vectors which can be utilized for gene therapy as taught herein include 
adenovirus, herpes virus, vaccinia, or, preferably, an RNA virus such as a retrovirus. 
Preferably, the retroviral vector is a derivative of a murine or avian retrovirus. Examples 

15 of retroviral vectors in which a single foreign gene can be inserted include, but are not 
limited to: Moloney murine leukemia virus (MoMuLV), Harvey murine sarcoma virus 
(HaMuSV), murine mammary tumor virus (MuMTV), and Rous Sarcoma Virus (RSV). 
A number of additional retroviral vectors can incorporate multiple genes. All of these 
vectors can transfer or incorporate a gene for a selectable marker so that transduced cells 

20 can be identified and generated. By inserting a GDF-8 sequence of interest into the viral 
vector, along with another gene which encodes the ligand for a receptor on a specific 
target cell, for example, the vector is now target specific. Retroviral vectors can be made 
target specific by attaching, for example, a sugar, a glycolipid, or a protein. Preferred 
targeting is accomplished by using an antibody to target the retroviral vector. Those of 

25 skill in the art will know of, or can readily ascertain without undue experimentation, 
specific polynucleotide sequences which can be inserted into the retroviral genome or 
attached to a viral envelope to allow target specific delivery of the retroviral vector 
containing the GDF-8 antisense polynucleotide. 
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Since recombinant retroviruses are defective, they require assistance in order to produce 
infectious vector particles. This assistance can be provided, for example, by using helper 
cell lines that contain plasmids encoding all of the structural genes of the retrovirus under 
the control of regulatory sequences within the LTR. These plasmids are missing a 
5 nucleotide sequence which enables the packaging mechanism to recognize an RNA 
transcript for encapsulation. Helper cell lines which have deletions of the packaging 
signal include, but are not limited to i|j2, PA317 and PA 12, for example. These cell lines 
produce empty virions, since no genome is packaged. If a retroviral vector is introduced 
into such cells in which the packaging signal is intact, but the structural genes are 
10 replaced by other genes of interest, the vector can be packaged and vector virion 
produced. 

Alternatively, NIH 3T3 or other tissue culture cells can be directly transfected with 
plasmids encoding the retroviral structural genes gag, pol and env, by conventional 
calcium phosphate transfection. These cells are then transfected with the vector plasmid 
1 5 containing the genes of interest. The resulting cells release the retroviral vector into the 
culture medium. 

Another targeted delivery system for GDF-8 polynucleotides is a colloidal dispersion 
system. Colloidal dispersion systems include macromolecule complexes, nanocapsules, 
microspheres, beads, and lipid-based systems including oil-in-water emulsions, micelles, 

20 mixed micelles, and liposomes. The preferred colloidal system of this invention is a 
liposome. Liposomes are artificial membrane vesicles which are useful as delivery 
vehicles in vitro and in vivo. It has been shown that large unilamellar vesicles (LUV), 
which range in size from 0.2-4.0 |im can encapsulate a substantial percentage of an 
aqueous buffer containing large macromolecules. RNA, DNA and intact virions can be 

25 encapsulated within the aqueous interior and be delivered to cells in a biologically active 
form (Fraley, et al, Trends Biochem. ScL, 6:77, 1981). In addition to mammalian cells, 
liposomes have been used for delivery of polynucleotides in plant, yeast and bacterial 
cells, in order for a liposome to be an efficient gene transfer vehicle, the following 
characteristics should be present: (1) encapsulation of the genes of interest at high 
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efficiency while not compromising their biological activity; (2) preferential and 
substantial binding to a target cell in comparison to non-target cells; (3) delivery of the 
aqueous contents of the vesicle to the target cell cytoplasm at high efficiency; and (4) 
accurate and effective expression of genetic information (Manning, et al, Biotechniques, 
5 6:682,1988). 

The composition of the liposome is usually a combination of phospholipids, particularly 
high-phase-transition-temperature phospholipids, usually in combination with steroids, 
especially cholesterol. Other phospholipids or other lipids may also be used. The physical 
characteristics of liposomes depend on pH, ionic strength, and the presence of divalent 
1 0 cations. 

Examples of lipids useful in liposome production include phosphatidyl compounds, such 
a s phosphatidyiglycerol, phosphatidylcholine, phosphatidylserine, 

phosphatidylethanolamine, sphingolipids, cerebrosides, and gangliosides. Particularly 
useful are diacylphosphatidylglycerols, where the lipid moiety contains from 14-18 
15 carbon atoms, particularly from 16-18 carbon atoms, and is saturated. Illustrative 
phospholipids include egg phosphatidylcholine, dipalmitoylphosphatidylcholine and 
distearoylphosphatidylcholine. 

The targeting of liposomes can be classified based on anatomical and mechanistic 
factors. Anatomical classification is based on the level of selectivity, for example, 

20 organ-specific, cell-specific, and organelle-specific. Mechanistic targeting can be 
distinguished based upon whether it is passive or active. Passive targeting utilizes the 
natural tendency of liposomes to distribute to cells of the reticuloendothelial system 
(RES) in organs which contain sinusoidal capillaries. Active targeting, on the other hand, 
involves alteration of the liposome by coupling the liposome to a specific ligand such as 

25 a monoclonal antibody, sugar, glycolipid, or protein, or by changing the composition or 
size of the liposome in order to achieve targeting to organs and cell types other than the 
naturally occurring sites of localization. 
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The surface of the targeted delivery system may be modified in a variety of ways. In the 
case of a liposomal targeted delivery system, lipid groups can be incorporated into the 
lipid bilayer of the liposome in order to maintain the targeting ligand in stable association 
with the liposomal bilayer. Various linking groups can be used for joining the lipid 
5 chains to the targeting ligand. 

Due to the expression of GDF-8 in muscle, bone, kidney and adipose tissue, there are a 
variety of applications using the polypeptide, polynucleotide, and antibodies of the 
invention, related to these tissues. Such applications include treatment of cell prolifera- 
tive disorders involving these and other tissues, such as neural tissue. In addition, GDF-8 
10 may be useful in various gene therapy procedures. In embodiments where GDF-8 
polypeptide is administered to a subject, the dosage range is about 0.1 ug/kg to 100 
mg/kg; more preferably from about 1 ug/kg to 75 mg/kg and most preferably from about 
10 mg/kg to 50 mg/kg. 

Chromosomal Location o f GDF-8 

1 5 The data in Example 6 shows that the human GDF-8 gene is located on chromosome 2. 
By comparing the chromosomal location of GDF-8 with the map positions of various 
human disorders, it should be possible to determine whether mutations in the GDF-8 
gene are involved in the etiology of human diseases. For example, an autosomal 
recessive form of juvenile amyotrophic lateral sclerosis has been shown to map to 

20 chromosome 2 (Hentati, et aL, Neurology, 42 [Suppl.3]:201, 1992). More precise 
mapping of GDF-8 and analysis of DNA from these patients may indicate that GDF-8 
is, in fact, the gene affected in this disease. In addition, GDF-8 is useful for distinguish- 
ing chromosome 2 from other chromosomes. 

Transgenic Animals and Methods of Making the same 
25 Various methods to make the transgenic animals of the subject invention can be 
employed. Generally speaking, three such methods may be employed. In one such 
method, an embryo at the pronuclear stage (a "one cell embryo") is harvested from a 
female and the transgene is microinjected into the embryo, in which case the transgene 
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will be chromosomally integrated into both the germ cells and somatic cells of the 
resulting mature animal. In another such method, embryonic stem cells are isolated and 
the transgene incorporated therein by electroporation, plasmid transfection or 
microinjection, followed by reintroduction of the stem cells into the embryo where they 
5 colonize and contribute to the germ line. Methods for microinjection of mammalian 
species is described in United States Patent No. 4,873,191 . In yet another such method, 
embryonic cells are infected with a retrovirus containing the transgene whereby the germ 
cells of the embryo have the transgene chromosomally integrated therein. When the 
animals to be made transgenic are avian, because avian fertilized ova generally go 

10 through cell division for the first twenty hours in the oviduct, microinjection into the 
pronucleus of the fertilized egg is problematic due to the inaccessibility of the 
pronucleus. Therefore, of the methods to make transgenic animals described generally 
above, retrovirus infection is preferred for avian species, for example as described in U.S. 
5,162,215. If microinjection is to be used with avian species, however, a recently 

15 published procedure by Love et aL, (Biotechnology, 12, Jan 1994) can be utilized 
whereby the embryo is obtained from a sacrificed hen approximately two and one-half 
hours after the laying of the previous laid egg, the transgene is microinjected into the 
cytoplasm of the germinal disc and the embryo is cultured in a host shell until maturity. 
When the animals to be made transgenic are bovine or porcine, microinjection can be 

20 hampered by the opacity of the ova thereby making the nuclei difficult to identify by 
traditional differential interference-contrast microscopy. To overcome this problem, the 
ova can first be centrifuged to segregate the pronuclei for better visualization. 

The "non-human animals" of the invention bovine, porcine, ovine and avian animals 
(e.g., cow, pig, sheep, chicken, turkey). The "transgenic non-human animals 11 of the 

25 invention are produced by introducing "transgenes" into the germline of the non-human 
animal. Embryonal target cells at various developmental stages can be used to introduce 
transgenes. Different methods are used depending on the stage of development of the 
embryonal target cell. The zygote is the best target for micro-injection. The use of 
zygotes as a target for gene transfer has a major advantage in that in most cases the 

30 injected DNA will be incorporated into the host gene before the first cleavage (Brinster 
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et al, Proc. Natl. Acad. Sci. USA 82:4438-4442, 1985). As a consequence, all cells of the 
transgenic non-human animal will cany the incorporated transgene. This will in general 
also be reflected in the efficient transmission of the transgene to offspring of the founder 
since 50% of the germ cells will harbor the transgene. 

5 The term "transgenic" is used to describe an animal which includes exogenous genetic 
material within all of its cells. A "transgenic" animal can be produced by cross-breeding 
two chimeric animals which include exogenous genetic material within cells used in 
reproduction. Twenty-five percent of the resulting offspring will be transgenic i.e., 
animals which include the exogenous genetic material within all of their cells in both 

1 0 alleles. 50% of the resulting animals will include the exogenous genetic material within 
one allele and 25% will include no exogenous genetic material. 

In the microinjection method useful in the practice of the subject invention, the transgene 
is digested and purified free from any vector DNA e.g. by gel electrophoresis. It is 
preferred that the transgene include an operatively associated promoter which interacts 
15 with cellular proteins involved in transcription, ultimately resulting in constitutive 
expression. Promoters useful in this regard include those from cytomegalovirus (CMV), 
Moloney leukemia virus (MLV), and herpes virus, as well as those from the genes 
encoding metallothionin, skeletal actin, P-enolpyruvate carboxylase (PEPCK), 
phosphoglycerate (PGK), DHFR, and thymidine kinase. Promoters for viral long 

20 terminal repeats (LTRs) such as Rous Sarcoma Virus can also be employed. When the 
animals to be made transgenic are avian, preferred promoters include those for the 
chicken p-globin gene, chicken lysozyme gene, and avian leukosis virus. Constructs 
useful in plasmid transfection of embryonic stem cells will employ additional regulatory 
elements well known in the art such as enhancer elements to stimulate transcription, 

25 splice acceptors, termination and polyadenylation signals, and ribosome binding sites to 
permit translation. 
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Retroviral infection can also be used to introduce transgene into a non-human animal, as 
described above. The developing non-human embryo can be cultured in vitro to the 
blastocyst stage. During this time, the blastomeres can be targets for retro viral infection 
(Jaenich, R., Proc. Natl. Acad. Sci USA 73:1260-1264, 1976). Efficient infection of the 
5 blastomeres is obtained by enzymatic treatment to remove the zona pellucida (Hogan, et 
al (1986) in Manipulating the Mouse Embryo, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, N.Y.). The viral vector system used to introduce the transgene is 
typically a replication-defective retro virus carrying the transgene (Jahner, et al, Proc, 
Natl Acad. Sci. USA 82:6927-6931, 1985; Van der Putten, et al, Proc. Natl Acad, Sci 

10 USA 82:6148-6152, 1985). Transfection is easily and efficiently obtained by culturing 
the blastomeres on a monolayer of virus-producing cells (Van der Putten, supra; Stewart, 
et al, EMBO J. 6:383-388, 1987). Alternatively, infection can be performed at a later 
stage. Virus or virus-producing cells can be injected into the blastocoele (D. Jahner et al, 
Nature 298 :623-628, 1982). Most of the founders will be mosaic for the transgene since 

1 5 incorporation occurs only in a subset of the cells which formed the transgenic nonhuman 
animal. Further, the founder may contain various retro viral insertions of the transgene 
at different positions in the genome which generally will segregate in the offspring. In 
addition, it is also possible to introduce transgenes into the germ line, albeit with low 
efficiency, by intrauterine retroviral infection of the midgestation embryo (D. Jahner et 

20 al, supra). 

A third type of target cell for transgene introduction is the embryonal stem cell (ES). ES 
cells are obtained from pre-implantation embryos cultured in vitro and fused with 
embryos (M. J. Evans et al. Nature 292 : 1 54- 1 56, 1 98 1 ; M.O. Bradley et al, Nature 309: 
255-258, 1984; Gossler, et al, Proc, Natl Acad. Sci USA 83: 9065-9069, 1986; and 

25 Robertson et al, Nature 322 :445-448, 1986). Transgenes can be efficiently introduced 
into the ES cells by DNA transfection or by retro virus-mediated transduction. Such 
transformed ES cells can thereafter be combined with blastocysts from a nonhuman 
animal. The ES cells thereafter colonize the embryo and contribute to the germ line of 
the resulting chimeric animal. (For review see Jaenisch, R., Science 240 : 1468-1474, 

30 1988). 
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"Transformed" means a cell into which (or into an ancestor of which) has been 
introduced, by means of recombinant nucleic acid techniques, a heterologous nucleic acid 
molecule. "Heterologous" refers to a nucleic acid sequence that either originates from 
another species or is modified from either its original form or the form primarily 
5 expressed in the cell. 

"Transgene" means any piece of DNA which is inserted by artifice into a cell, and 
becomes part of the genome of the organism (i.e., either stably integrated or as a stable 
extrachromosomal element) which develops from that cell. Such a transgene may 
include a gene which is partly or entirely heterologous (i.e., foreign) to the transgenic 
10 organism, or may represent a gene homologous to an endogenous gene of the organism. 
Included within this definition is a transgene created by the providing of an RNA 
sequence which is transcribed into DNA and then incorporated into the genome. The 
transgenes of the invention include DNA sequences which encode GDF-8, and include 
GDF-sense, antisense, dominant negative encoding polynucleotides, which may be 
15 expressed in a transgenic non-human animal. The term "transgenic" as used herein 
additionally includes any organism whose genome has been altered by in vitro 
manipulation of the early embryo or fertilized egg or by any transgenic technology to 
induce a specific gene knockout. The term "gene knockout" as used herein, refers to the 
targeted disruption of a gene in vivo with complete loss of function that has been 
20 achieved by any transgenic technology familiar to those in the art. In one embodiment, 
transgenic animals having gene knockouts are those in which the target gene has been 
rendered nonfunctional by an insertion targeted to the gene to be rendered non-functional 
by homologous recombination. As used herein, the term "transgenic" includes any 
transgenic technology familiar to those in the art which can produce an organism 
25 carrying an introduced transgene or one in which an endogenous gene has been rendered 
non-functional or "knocked out." An example of a transgene used to "knockout" GDF-8 
function in the present Examples is described in Example 8 and FIGURE 12a. Thus, in 
another embodiment, the invention provides a transgene wherein the entire mature C- 
terminal region of GDF-8 is deleted. 
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The transgene to be used in the practice of the subject invention is a DNA sequence 
comprising a modified GDF-8 coding sequence. In a preferred embodiment, the GDF-8 
gene is disrupted by homologous targeting in embryonic stem cells. For example, the 
entire mature C-terminal region of the GDF-8 gene may be deleted as described in the 
5 examples below. Optionally, the GDF-8 disruption or deletion may be accompanied by 
insertion of or replacement with other DNA sequences, such as a non-functional GDF-8 
sequence. In other embodiments, the transgene comprises DNA antisense to the coding 
sequence for GDF-8. In another embodiment, the transgene comprises DNA encoding 
an antibody or receptor peptide sequence which is able to bind to GDF-8. The DNA and 

1 0 peptide sequences of GDF-8 are known in the art, the sequences, localization and activity 
disclosed in W094/21681 and pending United States patent application 08/033,923, filed 
on March 19, 1993, incorporated by reference in its entirety. The disclosure of both of 
these applications are hereby incorporated herein by reference. Where appropriate, DNA 
sequences that encode proteins having GDF-8 activity but differ in nucleic acid sequence 

1 5 due to the degeneracy of the genetic code may also be used herein, as may truncated 
forms, allelic variants and interspecies homologues. 

The invention also includes animals having heterozygous mutations in GDF-8 or partial 
inhibition of GDF-8 function or expression. A heterozygote would exhibit an 
intermediate increase in muscle and/or bone mass as compared to the homozygote as 

20 shown in Table 4 below. In other words, partial loss of function leads to a partial 
increase in muscle and bone mass. One of skill in the art would readily be able to 
determine if a particular mutation or if an antisense molecule was able to partially inhibit 
GDF-8. For example, in vitro testing may be desirable initially by comparison with wild- 
type or untreated GDF-8 (e.g., comparison of northern blots to examine a decrease in 

25 expression). 

After an embryo has been microinjected, colonized with transfected embryonic stem cells 
or infected with a retrovirus containing the transgene (except for practice of the subject 
invention in avian species which is addressed elsewhere herein) the embryo is implanted 
into the oviduct of a pseudopregnant female. The consequent progeny are tested for 



RWRnnriri' -wr^ on, 



WO 99/4016. 



PCT/US99/025II 



- 38 - 

incorporation of the transgene by Southern blot analysis of blood samples using 
transgene specific probes. PCR is particularly useful in this regard. Positive progeny 
(GO) are crossbred to produce offspring (Gl) which are analyzed for transgene 
expression by Northern blot analysis of tissue samples. To be able to distinguish 
5 expression of like-species transgenes from expression of the animals endogenous GDF-8 
gene(s), a marker gene fragment can be included in the construct in the 3' untranslated 
region of the transgene and the Northern probe designed to probe for the marker gene 
fragment. The serum levels of GDF-8 can also be measured in the transgenic animal to 
establish appropriate expression. Expression of the GDF-8 transgenes, thereby 

10 decreasing the GDF-8 in the tissue and serum levels of the transgenic animals and 
consequently increasing the muscle tissue or bone tissue content results in the foodstuffs 
from these animals (i.e. eggs, beef, pork, poultry meat, milk, etc.) having markedly 
increased muscle and/or bone content, such as ribs, and preferably without increased, and 
more preferably, reduced levels of fat and cholesterol. By practice of the subject 

15 invention, a statistically significant increase in muscle content, preferably at least a 2% 
increase in muscle content (e.g., in chickens), more preferably a 25% increase in muscle 
content as a percentage of body weight, more preferably greater than 40% increase in 
muscle content in these foodstuffs can be obtained. Similarly the subject invention may 
provide a significant increase in bone content, such as ribs, in these foodstuffs. 



20 Additional Methods of Use 

Thus, the present invention includes methods for increasing muscle and bone mass in 
domesticated animals, characterized by inactivation or deletion of the gene encoding 
growth and differentiation factor-8 (GDF-8). The domesticated animal is preferably 
selected from the group consisting of ovine, bovine, porcine, piscine and avian. The 

25 animal may be treated with an isolated polynucleotide sequence encoding growth and 
differentiation factor-8 which polynucleotide sequence is also from a domesticated 
animal selected from the group consisting of ovine, bovine, porcine, piscine and avian. 
The present invention includes methods for increasing the muscle and/or bone mass in 
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domesticated animals characterized by administering to a domesticated animal 
monoclonal antibodies directed to the GDF-8 polypeptide. The antibody may be an anti- 
GDF-8, and may be either a monoclonal antibody or a polyclonal antibody. 

The invention includes methods comprising using an anti-GDF-8 monoclonal antibody, 
5 antisense, or dominant negative mutants as a therapeutic agent to inhibit the growth 
regulating actions of GDF-8 on muscle and bone cells. Muscle and bone cells are 
defined to include fetal or adult muscle cells, as well as progenitor cells which are 
capable of differentiation into muscle or bone. The monoclonal antibody may be a 
humanized (e.g. , either fully or a chimeric) monoclonal antibody, of any species origin, 

10 such as murine, ovine, bovine, porcine or avian. Methods of producing antibody 
molecules with various combinations of "humanized" antibodies are well known in the 
art and include combining murine variable regions with human constant regions (Cabily, 
et al Proc.NatLAcad.Sci. USA, 81:3273, 1984), or by grafting the murine-antibody 
complementary determining regions (CDRs) onto the human framework (Richmann, et 

15 al t Nature 332:323, 1988). Other general references which teach methods for creating 
humanized antibodies include Morrison, et al f Science, 229 :1202, 1985; Jones, et al f 
Nature, 321 :522, 1986; Monroe, et al., Nature 312:779, 1985; Oi, et al t BioTechniques, 
4:214, 1986; European Patent Application No. 302,620; and U.S. Patent No. 5,024,834. 
Therefore, by humanizing the monoclonal antibodies of the invention for in vivo use, an 

20 immune response to the antibodies would be greatly reduced. 

The monoclonal antibody, GDF-8 polypeptide, or GDF-8 polynucleotide (all "GDF-8 
agents") may have the effect of increasing the development of skeletal muscles and 
bones, such as ribs. In preferred embodiments of the claimed methods, the GDF-8 
monoclonal antibody, polypeptide, or polynucleotide is administered to a patient 
25 suffering from a disorder selected from the group consisting of muscle wasting disease, 
neuromuscular disorder, muscle atrophy, bone degenerative diseases, osteoporosis, renal 
disease or aging. The GDF-8 agent may also be administered to a patient suffering from 
a disorder selected from the group consisting of muscular dystrophy, spinal cord injury, 
traumatic injury, congestive obstructive pulmonary disease (COPD), AIDS or cachechia. 
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In a preferred embodiment, the GDF-8 agent is administered to a patient suffereing from 
any of these diseases by intravenous, intramuscular or subcutaneous injection; preferably, 
a monoclonal antibody is administered within a dose range between about 0.1 mg/kg to 
about 100 mg/kg; more preferably between about 1 ug/kg to 75 mg/kg; most preferably 
5 from about 1 0 mg/kg to 50 mg/kg. The antibody may be administered, for example, by 
bolus injunction or by slow infusion. Slow infusion over a period of 30 minutes to 2 
hours is preferred. The GDF-8 agent may be formulated in a formulation suitable for 
administration to a patient. Such formulations are known in the art. 

The dosage regimen will be determined by the attending physician considering various 
10 factors which modify the action of the GDF-8 protein, e.g. amount of tissue desired to 
be formed, the site of tissue damage, the condition of the damaged tissue, the size of a 
wound, type of damaged tissue, the patient's age, sex, and diet, the severity of any 
infection, time of administration and other clinical factors. The dosage may vary with 
the type of matrix used in the reconstitution and the types of agent, such as anti-GDF-8 
15 antibodies, to be used in the composition. Generally, systemic or injectable administra- 
tion, such as intravenous (TV), intramuscular (IM) or subcutaneous (Sub-Q) injection. 
Adrninistration will generally be initiated at a dose which is minimally effective, and the 
dose will be increased over a preselected time course until a positive effect is observed. 
Subsequently, incremental increases in dosage will be made limiting such incremental 
20 increases to such levels that produce a corresponding increase in effect, while taking into 
account any adverse affects that may appear. The addition of other known growth 
factors, such as IGF I (insulin like growth factor I), human, bovine, or chicken growth 
hormone which may aid in increasing muscle and bone mass, to the final composition, 
may also affect the dosage. In the embodiment where an anti-GDF-8 antibody is 
25 administered, the anti-GDF-8 antibody is generally administered within a dose range of 
about 0.1 ug/kg to about 100 mg/kg.; more preferably between about 10 mg/kg to 50 
mg/kg. 
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Progress can be monitored by periodic assessment of tissue growth and/or repair. The 
progress can be monitored, for example, x-rays, histomorphometric determinations and 
tetracycline labeling. 

Screening for GDF-8 Modulating Compounds 
5 In another embodiment, the invention provides a method for identifying a compound or 
molecule that modulates GDF-8 protein activity or gene expression. The method 
includes incubating components comprising the compound, GDF-8 polypeptide or with 
a recombinant cell expressing GDF-8 polypeptide, under conditions sufficient to allow 
the components to interact and determining the effect of the compound on GDF-8 

10 activity or expression. The effect of the compound on GDF-8 activity can be measured 
by a number of assays, and may include measurements before and after incubating in the 
presence of the compound. Compounds that affect GDF-8 activity or gene expression 
include peptides, peptidomimetics, polypeptides, chemical compounds and biologic 
agents. Assays include Northern blot analysis of GDF-8 mRNA (for gene expression), 

1 5 Western blot analysis (for protein level) and muscle fiber analysis (for protein activity). 

The above screening assays may be used for detecting the compounds or molecules that 
bind to the GDF-8 receptor or GDF-8 polypeptide, in isolating molecules that bind to the 
GDF-8 gene, for measuring the amount of GDF-8 in a sample, either polypeptide or RNA 
(mRNA), for identifying molecules that may act as agonists or antagonists, and the like. 
20 For example, GDF-8 antagonists are useful for treatment of muscular and adipose tissue 
disorders (e.g., obesity). 

Incubating includes conditions which allow contact between the test compound and 
GDF-8 polypeptide or with a recombinant cell expressing GDF-8 polypeptide. 
Contacting includes in solution and in solid phase, or in a cell. The test compound may 
25 optionally be a combinatorial library for screening a plurality of compounds. Com- 
pounds identified in the method of the invention can be further evaluated, detected, 
clofted, sequenced, and the like, either in solution or after binding to a solid support, by 
any method usually applied to the detection of a specific DNA sequence such as PCR, 
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oligomer restriction (Saiki, et al, Bio/Technology, 3:1008-1012, 1985), allele-specific 
oligonucleotide (ASO) probe analysis (Conner, et al, Proc. Natl Acad, Sci. USA, 
80:278, 1983), oligonucleotide Landegren, et al, Science, 241:1077, 1988), and the like. 
Molecular techniques for DNA analysis have been reviewed (Landegren, et al. s Science, 
5 242:229-237, 1988). 

All references cited herein are hereby incorporated by reference in their entirety. 

The following examples are intended to illustrate but not limit the invention. While they 
are typical of those that might be used, other procedures known to those skilled in the art 
may alternatively be used. 

10 EXAMPLE 1 

IDENTIFICATION AND ISOLATION OF A NOVEL 
TGF-p FAMILY MEMBER 

To identify a new member of the TGF-P superfamily, degenerate oligonucleotides were 
designed which corresponded to two conserved regions among the known family 

15 members: one region spanning the two tryptophan residues conserved in all family 
members except MIS and the other region spanning the invariant cysteine residues near 
the C -terminus. These primers were used for polymerase chain reactions on mouse 
genomic DNA followed by subcloning the PCR products using restriction sites placed 
at the 5 1 ends of the primers, picking individual E. coli colonies carrying these subcloned 

20 inserts, and using a combination of random sequencing and hybridization analysis to 
eliminate known members of the superfamily. 

GDF-8 was identified from a mixture of PCR products obtained with the primers 
SJL141 : 5 , -CCGGAATTCGGITGG(G/C/A)A(G/A/T/C)(A/G)A(T/C)TGG(A/G)TI 
(A/G)TI(T/G)CICC-3' (SEQ ID NO:l) 

25 SJL147: 

5 , -CCGGAATTC(G/A)CAI(G/C)C(G/A)CA(G/A)CT(GLVT/C) 
TCIACI(G/A)(T/C)CAT-3' (SEQ ID NO:2) 
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PCR using these primers was carried out with 2 (ig mouse genomic DNA at 94 °C for 1 
min, 50 °C for 2 min, and 72 °C for 2 min for 40 cycles. 

PCR products of approximately 280 bp were gel-purified, digested with Eco Rl, 
gel-purified again, and subcloned in the Bluescript vector (Stratagene, San Diego, CA). 
5 Bacterial colonies carrying individual subclones were picked into 96 well microtiter 
plates, and multiple replicas were prepared by plating the cells onto nitrocellulose. The 
replicate filters were hybridized to probes representing known members of the family, 
and DNA was prepared from nonhybridizing colonies for sequence analysis. 

The primer combination of SJL141 and SJL147, encoding the amino acid sequences 
10 GW(1^Q/N/K/D/E)(D/N)W(V/1A4)(V/I/M)(A/S)P (SEQ ID NO:9) and 
M(V/I/MyT/A)V(D/E)SC(G/A)C (SEQ ID NO: 10), respectively, yielded four previously 
identified sequences (BMP-4, inhibin,pB, GDF-3 and GDF-5) and one novel sequence, 
which was designated GDF-8, among 1 10 subclones analyzed. 

Human GDF-8 was isolated using the primers: 

1 5 ACM13: 5-CGCGGATCCAGAGTCAAGGTGACAGACACAC-3 1 (SEQ ID NO:3); and 
ACM14: 5-CGCGGATCCTCCTCATGAGCACCCACAGCGGTC-3 1 (SEQ IDNO:4) 

PCR using these primers was carried out with one jig human genomic DNA at 94 °C for 
1 min, 58 °C for 2 min, and 72 °C for 2 min for 30 cycles. The PCR product was digested 
with Bam HI, gel-purified, and subcloned in the Bluescript vector (Stratagene, San 
20 Francisco, CA). 

EXAMPLE 2 

EXPRESSION PATTERN AND SEQUENCE OF GDF-8 
To determine the expression pattern of GDF-8, RNA samples prepared from a variety of 
adult tissues were screened by Northern analysis. RNA isolation and Northern analysis 
25 were carried out as described previously (Lee, S.J., MoL Endocrinol, 4:1034, 1990) 
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except that hybridization was carried out in 5X SSPE, 10% dextran sulfate, 50% 
formamide, 1% SDS, 200 ng/ml salmon DNA, and 0.1% each of bovine serum albumin, 
ficoll, and polyvinylpyrrolidone. Five micrograms of twice poly A-selected RNA 
prepared from each tissue (except for muscle, for which only 2 jag RNA was used) were 
5 electrophoresed on formaldehyde gels, blotted, and probed with GDF-8. As shown in 
FIGURE 1, the GDF-8 probe detected a single mRNA species expressed at highest levels 
in muscle and at significantly lower levels in adipose tissue. 

To obtain a larger segment of the GDF-8 gene, a mouse genomic library was screened 
with a probe derived from the GDF-8 PCR product. The partial sequence of a GDF-8 

10 genomic clone is shown in FIGURE 2a. The sequence contains an open reading frame 
corresponding to the predicted C-terminal region of the GDF-8 precursor protein. The 
predicted GDF-8 sequence contains two potential proteolytic processing sites, which are 
boxed. Cleavage of the precursor at the second of these sites would generate a mature C 
terminal fragment 109 amino acids in length with a predicted molecular weight of 

15 12,400. The partial sequence of human GDF-8 is shown in FIGURE 2b. Assuming no 
PCR-induced errors during the isolation of the human clone, the human and mouse amino 
acid sequences in this region are 100% identical. 

The C-terminal region of GDF-8 following the putative proteolytic processing site shows 
significant homology to the known members of the TGF-P; superfamily (FIGURE 3). 

20 FIGURE 3 shows the alignment of the C-terminal sequences of GDF-8 with the 
corresponding regions of human GDF-1 (Lee, Proa Natl Acad, Set USA, 88:4250-4254, 
1991), human BMP-2 and 4 (Wozney, et al t Science, 242:1528-1534, 1988), human 
Vgr-1 (Celeste, et al Proc. Natl Acad, ScL USA, 87:9843-9847, 1990), human OP-1 
(Ozkaynak, et aU EMBOJ., 9:2085-2093, 1990), human BMP-5 (Celeste, et al, Proc. 

25 Natl Acad. ScL USA, 87:9843-9847, 1990), human BMP-3 (Wozney, et aL Science, 
242:1528-1534, 1988), human MiS (Cate, et al Cell, 45:685-698,1986), human inhibin 
alpha, pA, and pB (Mason, etal t Biochem, Biophys. Res. Commun., 135:957-964, 1986), 
human TGF-Pl Perynck, et al, Nature, 316:701 -705, 1985), humanTGF-R2 (deMartin, 
et al, EMBO J., 6:3673-3677, 1987), and human TGF-P3 (ten Dijke, et al, Proc. Natl 
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Acad. Sci. USA, 85*4715-4719, 1988). The conserved cysteine residues are boxed. 
Dashes denote gaps introduced in order to maximize the alignment. 

GDF-8 contains most of the residues that are highly conserved in other family members, 
including the seven cysteine residues with their characteristic spacing. Like the TGF-ps 
5 and inhibin ps, GDF-8 also contains two additional cysteine residues. In the case of 
TGF-p2, these two additional cysteine residues are known to form an intramolecular 
disulfide bond (Daopin, et al, Science, 257:369, 1992; Schlunegger and Grutter, Nature, 
358:430. 1992). 

FIGURE 4 shows the amino acid homologies among the different members of the TGF-p 
10 superfamily. Numbers represent percent amino acid identities between each pair 
calculated from the first conserved cysteine to the C terminus. Boxes represent 
homologies among highly-related members within particular subgroups. In this region, 
GDF-8 is most homologous to Vgr-1 (45% sequence identity). 

EXAMPLE 3 

15 ISOLATION OF cDNA CLONES ENCODING MURINE AND HUMAN GPF-8 

In order to isolate full-length cDNA clones encoding murine and human GDF-8, cDNA 
libraries were prepared in the lambda ZAP II vector (Stratagene) using RNA prepared 
from skeletal muscle. From 5 \i% of twice poly A-selected RNA prepared from murine 
and human muscle, cDNA libraries consisting of 4.4 million and 1 .9 million recombinant 
20 phage, respectively, were constructed according to the instructions provided by 
Stratagene. These libraries were screened without amplification. Library screening and 
characterization of cDNA inserts were carried out as described previously (Lee, Mol 
Endocrinol, 4: 1034-1040). 

From 2.4 x 10 6 recombinant phage screened from the murine muscle cDNA library, 
25 greater than 280 positive phage were identified using a murine GDF-8 probe derived 
from a genomic clone, as described in Example 1. The entire nucleotide sequence of the 
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longest cDNA insert analyzed is shown in FIGURE 5a and 5b and SEQ ID NO: 1 1 . The 
2676 base pair sequence contains a single long open reading frame beginning with a 
methionine codon at nucleotide 104 and extending to a TGA stop codon at nucleotide 
1232. Upstream of the putative initiating methionine codon is an in-frame stop codon at 

5 nucleotide 23. The predicted pre-pro-GDF-8 protein is 76 amino acids in length. The 
sequence contains a core of hydrophobic amino acids at the N-terminus suggestive of a 
signal peptide for secretion (FIGURE 6a), one potential N-glycosylation site at 
asparagine 72, a putative RXXR proteolytic cleavage site at amino acids 264-267, and 
a C-terminal region showing significant homology to the known members of the TGF-P 

1 0 superfamily. Cleavage of the precursor protein at the putative RXXR site would generate 
a mature C-terminal GDF-8 fragment 109 amino acids in length with a predicted 
molecular weight of approximately 12,400. 

From 1.9 x 10 6 recombinant phage screened from the human muscle cDNA library, 4 
positive phage were identified using a human GDF-8 probe derived by polymerase chain 

1 5 reaction on human genomic DNA. The entire nucleotide sequence of the longest cDNA 
insert is shown in FIGURE 5c and 5d and SEQ ID NO:13. The 2743 base pair sequence 
contains a single long open reading frame beginning with a methionine codon at 
nucleotide 59 and extending to a TGA stop codon at nucleotide 1 184. The predicted 
pre-pro-GDF-8 protein is 375 amino acids in length. The sequence contains a core of 

20 hydrophobic amino acids at the N-terminus suggestive of a signal peptide for secretion 
(FIGURE 6b), one potential N-glycosylation site at asparagine 71, and a putative RX)(R 
proteolytic cleavage site at amino acids 263-266. FIGURE 7 shows a comparison of the 
predicted murine (top) and human (bottom) GDF-8 amino acid sequences. Numbers 
indicate amino acid position relative to the N-terminus. Identities between the two 

25 sequences are denoted by a vertical line. Murine and human GDF-8 are approximately 
94% identical in the predicted pro-regions and 100% identical following the predicted 
RXXR cleavage sites. 
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EXAMPLE 4 
PIMERIZATION OF GDF-8 

To determine whether the processing signals in the GDF-8 sequence are functional and 
whether GDF-8 forms dimers like other members of the TGF-B superfamily, the GDF-8 
5 cDNA was stably expressed in CHO cells. The GDF-8 coding sequence was cloned into 
the pMSXND expression vector (Lee and Nathans, J. Biol Chem., 263:3521,(1988) and 
transfected into CHO cells. Following G41 8 selection, the cells were selected in 0.2 
methotrexate, and conditioned medium from resistant cells was concentrated and 
electrophoresed on SDS gels. Conditioned medium was prepared by Cell Trends, Inc. 

10 (Middletown, MD). For preparation of anti-GDF-8 serum, the C-terminal region of 
GDF-8 (amino acids 268 to 376) was expressed in bacteria using the RSET vector 
(Invitrogen, San Diego, CA), purified using a nickle chelate column, and injected into 
rabbits. All immunizations were carried out by Spring Valley Labs (Woodbine, MD). 
Western analysis using [ 125 I]iodoprotein A was carried out as described (Burnette, W.N., 

15 Anal Biochem.* 1 12 :195, 1981). Western analysis of conditioned medium prepared from 
these cells using an antiserum raised against a bacterially-expressed C-terminal fragment 
of GDF-8 detected two protein species with apparent molecular weights of approximately 
52K and 1 5K under reducing conditions, consistent with unprocessed and processed 
forms of GDF-8, respectively. No bands were obtained either with preimmune serum or 

20 with conditioned medium from CHO cells transfected with an antisense construct. Under 
non-reducing conditions, the GDF-8 antiserum detected two predominant protein species 
with apparent molecular weights of approximately 101K and 25K, consistent with 
dimeric forms of unprocessed and processed GDF-8, respectively. Hence, like other 
TGF-B family members, GDF-8 appears to be secreted and proteolytically processed, and 

25 the C-terminal region appears to be capable of forming a disulfide-linked dimer. 
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EXAMPLE 5 

PREPARATION OF ANTIBODIES AGAINST GDF-8 AND 
EXPRESSION OF GDF-8 IN MAMMALIAN CELLS 

In order to prepare antibodies against GDF-8, GDF-8 antigen was expressed as a fusion 
5 protein in bacteria. A portion of murine GDF-8 cDNA spanning amino acids 268-376 
(mature region) was inserted into the pRSET vector (Invitrogen) such that the GDF-8 
coding sequence was placed in frame with the initiating methionine codon present in the 
vector; the resulting construct created an open reading frame encoding a fusion protein 
with a molecular weight of approximately 16,600. The fusion construct was transformed 
10 into BL21 (DE3) (pLysS) cells, and expression of the fusion protein was induced by 
treatment with isopropylthio-P-galactoside as described (Rosenberg, et ai, Gene, 
56:125-135). The fusion protein was then purified by metal chelate chromatography 
according to the instructions provided by Invitrogen. A Coomassie blue-stained gel of 
unpurified and purified fusion proteins is shown in FIGURE 8. 

15 The purified fusion protein was used to immunize both rabbits and chickens. Immuniza- 
tion of rabbits was carried out by Spring Valley Labs (Sykesville, MD), and immuniza- 
tion of chickens was carried out by HRP, Inc. (Denver, PA). Western analysis of sera 
both from immunized rabbits and from immunized chickens demonstrated the presence 
of antibodies directed against the fusion protein. 

20 To express GDF-8 in mammalian cells, the murine GDF-8 cDNA sequence from 
nucleotides 48-1303 was cloned in both orientations downstream of the metallothionein 
I promoter in the pMSXND expression vector; this vector contains processing signals 
derived from SV40, a dihydrofolate reductase gene, and a gene conferring resistance to 
the antibiotic G418 (Lee and Nathans, J. Biol Chem., 263:3521-3527). The resulting 

25 constructs were transfected into Chinese hamster ovary cells, and stable tranfectants were 
selected in the presence of G41 8. Two milliliters of conditioned media prepared from the 
G4 18 -resistant cells were dialyzed, lyophilized, electrophoresed under denaturing, 
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reducing conditions, transferred to nitrocellulose, and incubated with anti-GDF-8 
antibodies (described above) and [ l25 l]iodoproteinA. 

As shown in FIGURE 9, the rabbit GDF-8 antibodies (at a 1:500 dilution) detected a 
protein of approximately the predicted molecular weight for the mature C-terminal 
5 fragment of GDF-8 in the conditioned media of cells transfected with a construct in 
which GDF-8 had been cloned in the correct (sense) orientation with respect to the 
metal lothionein promoter (lane 2); this band was not detected in a similar sample 
prepared from cells transfected with a control antisense construct (lane 1 ). Similar results 
were obtained using antibodies prepared in chickens. Hence, GDF-8 is secreted and 
10 proteolytically processed by these transfected mammalian cells. 

EXAMPLE 6 
EXPRESSION PATTERN OF GDF-8 

To determine the pattern of GDF-8, 5 jig of twice poly A-selected RNA prepared from 
a variety of murine tissue sources were subjected to Northern analysis. As shown in 

1 5 FIGURE 1 0a (and as shown previously in Example 2), the GDF-8 probe detected a single 
mRNA species present almost exclusively in skeletal muscle among a large number of 
adult tissues surveyed. On longer exposures of the same blot, significantly lower but 
detectable levels of GDF-8 mRNA were seen in fat, brain, thymus, heart, and lung. 
Hence, these results confirm the high degree of specificity of GDF-8 expression in 

20 skeletal muscle. GDF-8 mRNA was also detected in mouse embryos at both gestational 
ages (day 12.5 and day 18.5 post-coital) examined but not in placentas at various stages 
of development (FIGURE 10b). 

To further analyze the expression pattern of GDF-8, in situ hybridization was performed 
on mouse embryos isolated at various stages of development. 
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For all in situ hybridization experiments, probes corresponding to the C-terminal region 
of GDF-8 were excluded in order to avoid possible cross-reactivity with other members 
of the superfamily. Whole mount in situ hybridization analysis was carried out as 
described (Wilkinson, D.G., In Situ Hybridization, A Practical Approach, pp. 75-83, 
5 IRL. Press, Oxford, 1992) except that blocking and antibody incubation steps were 
carried out as in Knecht et al (Knecht, et al, Development, 121:1927, 1955). Alkaline 
phosphatase reactions were carried out for 3 hours for day 10.5 embryos and overnight 
for day 9.5 embryos. Hybridization was carried out using digoxigenin-labelled probes 
spanning nucleotides 8-811 and 1298-2676, which correspond to the pro-region and 3' 

10 untranslated regions, respectively. In situ hybridization to sections was carried out as 
described (Wilkinson, et al, Cell, 50:79, 1987) using 35 S-labelled probes ranging from 
approximately 100-650 bases in length and spanning nucleotides 8-793 and 1566-2595. 
Following hybridization and washing, slides were dipped in NTB-3 photographic 
emulsion, exposed for 16-19 days, developed and stained with either hematoxylin and 

15 eosin or toluidine blue. RNA isolation, poly A selection, and Northern analysis were 
carried out as described previously (McPherron and Lee, J, Biol Chem., 268 :3444. 
1993). 

At all stages examined, the expression of GDF-8 mRNA appeared to be restricted to 
developing skeletal muscle. At early stages, GDF-8 expression was restricted to 

20 developing somites. By whole mount in situ hybridization analysis, GDF-8 mRNA could 
first be detected as early as day 9.5 post coitum in approximately one-third of the 
somites. At this stage of development, hybridization appeared to be restricted to the most 
mature (9 out of 21 in this example), rostral somites. By day 10.5 p.c., GDF-8 
expression was clearly evident in almost every somite (28 out of 33 in this example 

25 shown). Based on in situ hybridization analysis of sections prepared from day 10.5 p.c. 
embryos, the expression of GDF-8 in somites appeared to be localized to the myotome 
compartment. At later stages of development, GDF-8 expression was detected in a wide 
range of developing muscles. 
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GDF-8 continues to be expressed in adult animals as well. By Northern analysis, GDF-8 
mRNA expression was seen almost exclusively in skeletal muscle among the different 
adult tissues examined. A significantly lower though clearly detectable signal was also 
seen in adipose tissue. Based on Northern analysis of RNA prepared from a large 
5 number of different adult skeletal muscles, GDF-8 expression appeared to be widespread 
although the expression levels varied among individual muscles. 

EXAMPLE 7 
CHROMOSOMAL LOCALIZATION OF GDF-8 

In order to map the chromosomal location of GDF-8, DNA samples from human/rodent 
1 0 somatic cell hybrids (Drwinga, et al., Genomics, 16:3 11-413,1 993; Dubois and Naylor, 
Genomics, 16:3 15-319, 1 993) were analyzed by polymerase chain reaction followed by 
Southern blotting. Polymerase chain reaction was carried out using primer #83, 5-C- 
GCGGATCCGTGGATCTAAATG AGAAC AGTGAGC-3 1 (SEQ ID NO: 1 5) and primer 
#84, S-CGCGAATTCTCAGGTAATGATTGTTTCCGTTGTAGCG-SXSEQ ID NO:16) 
15 for 40 cycles at 94°C for 2 minutes, 60°C for 1 minute, and 72°C for 2 minutes. These 
primers correspond to nucleotides 119 to 143 (flanked by a Bam HI recognition 
sequence), and nucleotides 394 to 418 (flanked by an Eco Rl recognition sequence), 
respectively, in the human GDF-8 cDNA sequence. PCR products were electrophoresed 
on agarose gels, blotted, and probed with oligonucleotide #100, 
20 5 - AC ACT AA ATCTTC AAG AAT A-3 1 (SEQ ID NO:17), which corresponds to a 
sequence internal to the region flanked by primer #83 and #84. Filters were hybridized 
in 6 X SSC, 1 X Denhardt's solution, 100|ag/ml yeast transfer RNA, and 0.05% sodium 
pyrophosphate at 50 °C. 

As shown in FIGURE 1 1, the human-specific probe detected a band of the predicted size 
25 (approximately 320 base pairs) in the positive control sample (total human genomic 
DNA) and in a single DNA sample from the human/rodent hybrid panel. This positive 
signal corresponds to human chromosome 2. The human chromosome contained in each 
of the hybrid cell lines is identified at the top of each of the first 24 lanes (1-22, X, and 
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Y). In the lanes designated M, CHO, and H, the starting DNA template was total 
genomic DNA from mouse, hamster, and human sources, respectively. In the lane 
marked Bl , no template DNA was used. Numbers at left indicate the mobilities of DNA 
standards. These data show that the human GDF-8 gene is located on chromosome 2. 

5 EXAMPLE 8 

GDF-8 TRANSGENIC KNOCKOUT MICE 

The GDF-8, we disrupted the GDF-8 gene was disrupted by homologous targeting in 
embryonic stem cells. To ensure that the resulting mice would be null for GDF-8 
function, the entire mature C-terminal region was deleted and replaced by a neo cassette 
10 (Figure 12a). A murine 129 SV/J genomic library was prepared in lambda FIX II 
according to the instructions provided by Stratagene (La Jolla, CA). The structure of the 
GDF-8 gene was deduced from restriction mapping and partial sequencing of phage 
clones isolated from this library. Vectors for preparing the targeting construct were 
kindly provided by Philip Soriano and Kirk Thomas University. Rl ES cells were trans- 
15 fected with the targeting construct, selected with gancyclovir (2 ^iM) and G418 (250 
yug/ml), and analyzed by Southern analysis. Homologously targeted clones were injected 
into C57BL/6 blastocysts and transferred into pseudopregnant females. Germhne 
transmission of the targeted allele was obtained in a total of 9 male chimeras from 5 
independently-derived ES clones. Genomic Southern blots were hybridized at 42°C as 
20 described above and washed in 0.2X SSC, 0.1% SDS at 42°C. 

For whole leg analysis, legs of 14 week did mice were skinned, treated with 0.2 M EDTA 
in PBS at 4°C for 4 weeks followed by 0.5 M sucrose in PBS at 4°C. For fiber number 
and size analysis, samples were directly mounted and frozen in isopentane as described 
(Brumback and Leech, Color Atlas of Muscle Histochemistry, pp. 9-33, PSG Publishing 
25 Company, Littleton, MA, 1 984). Ten to 30 A*m sections were prepared using a cryostat 
and stained with hematoxylin and eosin. Muscle fiber numbers were determined from 
sections taken from the widest part of the tibialis cranialis muscle. Muscle fiber sizes 
were measured from photographs of sections of tibialis cranialis and gastrocnemius 
muscles. Fiber type analysis was carried out using the mysosin ATPase assay after 
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pretreatment at pH 4.35 as described (dimming, et al t Color Atlas of Muscle Pathology, 
pp. 184-185, 1994) and by immunohistochemistry using an antibody directed against 
type I myosin (MY32, Sigma) and the Vectastain method (Vector Labs); in the 
immunohistochemical experiments, no staining was seen when the primary antibodies 
5 were left out. Carcasses were prepared from shaved mice by removing the all of the 
internal organs and associated fat and connective tissue. Fat content of carcasses from 
4 month old males was determined as described (Leshner, et al f Physiol Behavior, 
9:281, 1972). 

For protein and DNA analysis, tissue was homogenized in 150 mM NaCl, 100 mM 
10 EDTA. Protein concentrations were determined using the Biorad protein assay. DNA 
was isolated by adding SDS to 1%, treating with 1 mg/ml proteinase K overnight at 
55°C, extracting 3 times with phenol and twice with chloroform, and precipitating with 
ammonium acetate and EtOH. DNA was digested with 2 mg/ml RNase for 1 hour at 
37°C, and following proteinase K digestion and phenol and chloroform extractions, the 
15 DNA was precipitated twice with ammonium acetate and EtOH. 

Homologous targeting of the GDF-8 gene was seen in 13/131 gancyclovir/G418 
doubly-resistant ES cell clones. Following injection of these targeted clones into 
blastocysts, we obtained chimeras from 5 independently-derived ES clones that produced 
heterozygous pups when crossed to C57BL/6 females (Figure 12b). Genotypic analysis 
20 of 678 offspring derived from crosses of Fl heterozygotes showed 170 +/+ (25%), 380 
+/- (56%), and 128 -/- (19%). Although the ratio of genotypes was close to the expected 
ratio of 1 :2: 1 , the smaller than expected number of homozygous mutants appeared to be 
statistically significant (pO.OOl). 

Homozygous mutants were viable and fertile when crossed to C57BL/6 mice and to each 
25 other. Homozygous mutant animals, however, were approximately 30% larger than their 
heterozygous and wild type littermates (Table 1). The difference between mutant and 
wild type body weights appeared to be relatively constant irrespective of age and sex in 
adult animals. Adult mutants also displayed an abnormal body shape, with pronounced 
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shoulders and hips. When the skin was removed from animals that had been sacrificed, 
it was apparent that the muscles of the mutants were much larger than those of wild type 
animals. The increase in skeletal muscle mass appeared to be widespread throughout the 
body. Individual muscles isolated from homozygous mutant animals weighed 
5 approximately 2-3 times more than those isolated from wild type littermates (Table 2). 
Although the magnitude of the weight increase appeared to roughly correlate with the 
level of GDF-8 expression in the muscles examined. To determine whether the increased 
muscle mass could account for the entire difference in total body weights between wild 
type and mutant animals or whether many tissues were generally larger in the mutants, 
10 we compared the total body weights to carcass weights. As shown in Table 3, the 
difference in carcass weights between wild type and mutant animals was comparable to 
the difference in total body weights. Moreover, because the fat content of mutant and 
wild type animals was similar, these data are consistent with all of the total body weight 
difference resulting from an increase in skeletal muscle mass, although we have not 
1 5 formally ruled out the possibility that differences in bone mass might also contribute to 
the differences in total body mass. 

To determine whether the increase in skeletal muscle mass resulted from hyperplasia or 
from hypertrophy, histologic analysis of several different muscle groups was performed. 
The mutant muscle appeared grossly normal. No excess connective tissue or fat was seen 
20 nor were there any obvious signs of degeneration, such as widely varying fiber sizes (see 
below) or centrally-placed nuclei. Quantitation of the number of muscle fibers showed 
that at the widest portion of the tibialis cranialis muscle, the total cell number was 86% 
higher in mutant animals compared to wild type littermates [mutant = 5470 +/- 121 (n 
= 3), wild type - 2936 +/- 288 (n = 3); p < 0.01]. Consistent with this result was the 
25 finding that the amount of DN A extracted from mutant muscle was roughly 50% higher 
than from wild type muscle [mutant = 350 Mg (n = 4), wild type = 233 /zg (n = 3) from 
pooled gastrocnemius, plantaris, triceps brachii, tibialis cranialis, and pectoralis muscles; 
p = 0.05]. Hence, a large part of the increase in skeletal muscle mass resulted from 
muscle cell hyperplasia. However, muscle fiber hypertrophy also appeared to contribute 
30 to the overall increase in muscle mass. As shown in Figure 1 3, the mean fiber diameter 
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of the tibialis cranialis muscle and gastrocnemius muscle was 7% and 22% larger, 
respectively, in mutant animals compared to wild type littermates, suggesting that the 
cross-sectional area of the fibers was increased by approximately 14% and 49%, 
respectively. Notably, although the mean fiber diameter was larger in the mutants, the 
5 standard deviation in fiber sizes was similar between mutant and wild type muscle, 
consistent with the absence of muscle degeneration in mutant animals. The increase in 
fiber size was also consistent with the finding that the protein to DNA ratio (w/w) was 
slightly increased in mutant compared to wild type muscle [mutant = 871 +/- 1 1 1 (n = 
4), wild type = 624 +/- 85 (n = 3); p < 0.05]. 

10 Table 4 shows a comparison between muscle weight (in grams) from wild-type (+/+), 
hetero2yous (+/-) and a homozygous knock-out mice (-/-). The muscle mass is increased 
in heterozyogous as compared to wild-type animals. 

Finally, fiber type analysis of various muscles was carried out to determine whether the 
number of both type I (slow) and type II (fast) fibers was increased in the mutant 

15 animals. In most of the muscles examined, including the tibialis cranialis muscle, the 
vast majority of muscle fibers were type II in both mutant and wild type animals. Hence, 
based on the cell counts discussed above, the absolute number of type II fibers were 
increased in the tibialis cranialis muscle. In the soleus muscle, where the number of type 
I fibers was sufficiently high that we could attempt to quantitate the ratio of fiber types 

20 could be quantiated, the percent of type I fibers was decreased by approximately 33% in 
mutant compared to wild type muscle [wild type = 39.2 +/- 8.1 (n = 3), mutant = 26.4 +/- 
9.3 (n = 4)]; however, the variability in this ratio for both wild type and mutant animals 
was too high to support any firm conclusions regarding the relative number of fiber 
types. 
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EXAMPLE 9 
TSOI ATION OF PAT AND CH ICKEN GDF-8 

In order to isolate rat and chicken GDF-8 cDNA clones, skeletal muscle cDNA libraries 
prepared from these species were obtained from Stratagene and screened with a murine 
5 GDF-8 probe. Library screening was carried out as described previously (Lee, Mol. 
Endocrinol., 4:1034-1040) except that final washes were carried out in 2 X SSC at 65 °C. 
Partial sequence analysis of hybridizing clones revealed the presence of open reading 
frames highly related to murine and human GDF-8. Partial sequences of rat and chicken 
GDF-8 are shown in Figures 2c and 2d, respectively, and an alignment of the predicated 
10 rat and chicken GDF-8 amino acid sequences with those of murine and human GDF-8 
are shown in Figure 3b. Full length rat and chicken GDF-8 is shown in Figures 14d and 
14c, respectively and sequence alignment between murine, rat, human, baboon, porcine, 
ovine, bovine, chicken, and turkey sequences is shown in Figures 15a and 15b. All 
sequences contain an RSRR sequence that is likely to represent the proteolytic processing 
1 5 site. Following this RSRR sequence, the sequences contain a C-terminal region that is 
100% conserved among all four species. The absolute conservation of the C-terminal 
region between species as evolutionarily far apart as humans and chickens, and baboons 
and turkeys, suggests that this region will be highly conserved in many other species as 
well. 

20 Similar methodology was used to obtain the nucleotide and amino acid sequences for 
baboon (SEQ ID NO: 1 8 and 1 9, respectively; Figure 14a); bovine (SEQ ID NO.20 and 
21, respectively; Figure 14b); turkey (SEQ ID NO:26 and 27, respectively; Figure 14c); 
porcine (SEQ ID NO:28 and 29, respectively; Figure 14f); and ovine (SEQ ID NO:30 
and 31, respectively; Figure 14g). 
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EXAMPLE 10 
GDF-11 HOMOLOGY IN MAMMALIAN SPECIES 

The overall homology between GDF-1 1 and GDF-8 based upon their respective amino 
acid sequence is approximately 92% (see for example, PCT/US95/08543, which is 
5 incorporated herein by reference). Thus, it is expected that animals expressing GDF-8 
and GDF-1 1 will display similar phenotypes. Similarly, animals having a disruption in 
a GDF-8 or GDF-1 1 gene will display similar phenotypes. The relationship of GDF-8 
to GDF- 1 1 will be further understood in light of the following examples, in which GDF- 
1 1 knockout mice were created. 

10 Like most other TGF-0 family member, GDF-1 1 also appears to be highly conserved 
across species. By genomic Southern analysis, homologous sequences were detected in 
all mammalian species examined as well as in chickens and frogs (Figure 16). In most 
species, the GDF-11 probe also detected a second, more faintly hybridizing fragment 
corresponding to the myostatin gene (McPherron et al., 1997). 

15 EXAMPLE 11 

GDF-11 KNOCKOUT MICE 

To determine the biological function of GDF-11, we disrupted the GDF-11 gene by 
homologous targeting in embryonic stem cells. A murine 129 SV/J genomic library was 
prepared in lambda FDCII according to the instructions provided by Stratagene (La Jolla, 

20 CA). The structure of the GDF-1 1 gene was deduced from restriction mapping and 
partial sequencing of phage clones isolated from the library. Vectors for preparing the 
targeting construct were kindly provided by Philip Soriano and Kirk Thomas. To ensure 
that the resulting mice would be null for GDF-1 1 function, the entire mature C-terminal 
region was deleted and replaced by a neo cassette (Figure 17a,b). Rl ES cells were 

25 transfected with the targeting construct, selected with gancyclovir (2 |iM) and G41 8 (250 
fig/ml), and analyzed by Southern analysis. Homologous targeting of the GDF-1 1 gene 
was seen in 8/155 gancyclovir/G418 doubly resistant ES cell clones. Following injection 
of several targeted clones into C57BL/6J blastocysts, we obtained chimeras from one ES 
clone that produced heterozygous pups when crossed to both C57BL/6J and 129/SvJ 
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females. Crosses of C57BL/6J/129/SvJ hybrid Fl heterozygotes produced 49 wild-type 
(34%), 94 heterozygous (66%) and no homozygous mutant adult offspring. Similarly, 
there were no adult homozygous null animals seen in the 129/SvJ background (32 
wild-type (36%) and 56 heterozygous mutant (64%) animals). 

5 To determine the age at which homozygous mutants were dying, we genotyped litters of 
embryos isolated at various gestational ages from heterozygous females that had been 
mated to heterozygous males. At all embryonic stages examined, homozygous mutant 
embryos were present at approximately the predicted frequency of 25%. Among hybrid 
newborn mice, the different genotypes were also represented at the expected Mendelian 

10 ratio of 1 :2:1 (34 +/+ (28%), 61 +/- (50%), and 28 -/- (23%)). Homozygous mutant mice 
were bom alive and were able to breath and nurse. All homozygous mutants died, 
however, within the first 24 hours after birth. The precise cause of death was unknown, 
but the lethality may have been related to the fact that the kidneys in homozygous 
mutants were either severely hypoplastic or completely absent. A summary of the 

1 5 kidney abnormalities in these mice is shown in Figure 18. 

EXAMPLE 12 

ANATOMICAL DIFFERENCES IN GDF-11 KNOCKOUT MICE 

Homozygous mutant animals were easily recognizable by their severely shortened or 
absent tails (Figure 19a). To further characterize the tail defects in these homozygous 

20 mutant animals, we examined their skeletons to determine the degree of disruption of the 
caudal vertebrae. A comparison of wild-type and mutant skeleton preparations of late 
stage embryos and newborn mice, however, revealed differences not only in the caudal 
region of the animals but in many other regions as well. In nearly every case where 
differences were noted, the abnormalities appeared to represent homeotic transformations 

25 of vertebral segments in which particular segments appeared to have a morphology 
typical of more anterior segments. These transformations, which are summarized in 
Figure 20, were evident throughout the axial skeleton extending from the cervical region 
to the caudal region. Except for the defects seen in the axial skeleton, the rest of the 
skeleton, such as the cranium and limb bones, appeared normal. 
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Anterior transformations of the vertebrae in mutant newborn animals were most readily 
apparent in the thoracic region, where there was a dramatic increase in the number of 
thoracic (T) segments. All wild-type mice examined showed the typical pattern of 13 
thoracic vertebrae each with its associated pair of ribs (Figure 19(b,e)). In contrast, 
5 homozygous mutant mice showed a striking increase in the number of thoracic vertebrae. 
All homozygous mutants examined had 4 to 5 extra pairs of ribs for a total of 1 7 to 18 
(Figure 19(d,g)) although in over 1/3 of these animals, the 18th rib appeared to be 
rudimentary. Hence, segments that would normally correspond to lumbar (L) segments 
LI to L4 or L5 appeared to have been transformed into thoracic segments in mutant 
10 animals. 

Moreover, transformations within the thoracic region in which one thoracic vertebra had 
a morphology characteristic of another thoracic vertebra were also evident. For example, 
in wild-type mice, the first 7 pairs of ribs attach to the sternum, and the remaining 6 are 
unattached or free (Figure 19(e,h)). In homozygous mutants, there was an increase in the 

15 number of both attached and free pairs of ribs to 10-11 and 7-8, respectively (Figure 
19(gj)). Therefore, thoracic segments T8, T9, T10, and in some cases even Til, which 
all have free ribs in wild-type animals, were transformed in mutant animals to have a 
characteristic typical of more anterior thoracic segments, namely, the presence of ribs 
attached to the sternum. Consistent with this finding, the transitional spinous process and 

20 transitional articular processes which are normally found on T10 in wild-type animals 
were instead found on T13 in homozygous mutants (data not shown). Additional 
transformations within the thoracic region were also noted in certain mutant animals. For 
example, in wild-type mice, the ribs derived from Tl normally touch the top of the 
sternum. However, in 2/23 hybrid and 2/3 129/SvJ homozygous mutant mice examined, 

25 T2 appeared to have been transformed to have a morphology resembling that of Tl ; that 
is, in these animals, the ribs derived from T2 extended to touch the top of the sternum. 
In these cases, the ribs derived from Tl appeared to fuse to the second pair of ribs. 
Finally, in 82% of homozygous mutants, the long spinous process normally present on 
T2 was shifted to the position of T3. In certain other homozygous mutants, asymmetric 

30 fusion of a pair of vertebrosternal ribs was seen at other thoracic levels. 
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The anterior transformations were not restricted to the thoracic region. The anterior most 
transformation that we observed was at the level of the 6th cervical vertebra (C6). In 
wild-type mice, C6 is readily identifiable by the presence of two anterior tuberculi on the 
ventral side. In several homozygous mutant mice, although one of these two anterior 
5 tuberculi was present on C6, the other was present at the position of C7 instead. Hence, 
in these mice, C7 appeared to have been partially transformed to have a morphology 
resembling that of C6. One other homozygous mutant had 2 anterior tuberculi on C7 but 
retained one on C6 for a complete C7 to C6 transformation but a partial C6 to C5 
transformation. 

10 Transformations of the axial skeleton also extended into the lumbar region. Whereas 
wild-type animals normally have only 6 lumbar vertebrae, homozygous mutants had 8-9. 
At least 6 of the lumbar vertebrae in the mutants must have derived from segments that 
would normally have given rise to sacral and caudal vertebrae as the data described 
above suggest that 4 to 5 lumbar segments were transformed into thoracic segments. 

15 Hence, homozygous mutant mice had a total of 33-34 presacral vertebrae compared to 
26 presacral vertebrae normally present in wild-type mice. The most common presacral 
vertebral patterns were C7/T18/L8 and C7/T18/L9 for mutant mice compared to 
C7/T13/L6 for wild-type mice. The presence of additional presacral vertebrae in mutant 
animals was obvious even without detailed examination of the skeletons as the position 

20 of the hindlimbs relative to the forelimbs was displaced posteriorly by 7-8 segments. 

Although the sacral and caudal vertebrae were also affected in homozygous mutant mice, 
the exact nature of each transformation was not as readily identifiable. In wild-type 
mice, sacral segments SI and S2 typically have broad transverse processes compared to 
S3 and S4. In the mutants, there did not appear to be an identifiable SI or S2 vertebra. 
25 Instead, mutant animals had several vertebrae that appeared to have morphology similar 
to S3. In addition, the transverse processes of all 4 sacral vertebrae are normally fused 
to each other although in newborns often only fusions of the first 3 vertebrae are seen. 
In homozygous mutants, however, the transverse processes of the sacral vertebrae were 
usually unfiised. In the caudalmost region, all mutant animals also had severely 
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malformed vertebrae with extensive fusions of cartilage. Although the severity of the 
fusions made it difficult to count the total number of vertebrae in the caudal region, we 
were able to count up to 15 transverse processes in several animals. We were unable to 
determine whether these represented sacral or caudal vertebrae in the mutants because 
5 we could not establish morphologic criteria for distinguishing S4 from caudal vertebrae 
even in wild-type newborn animals. Regardless of their identities, the total number of 
vertebrae in this region was significantly reduced from the normal number of approxi- 
mately 30. Hence, although the mutants had significantly more thoracic and lumber 
vertebrae than wild-type mice, the total number of segments was reduced in the mutants 
1 0 due to the truncation of the tails. 

Heterozygous mice also showed abnormalities in the axial skeleton although the 
phenotype was much milder than in homozygous mice. The most obvious abnormality 
in heterozygous mice was the presence of an additional thoracic segment with an 
associated pair of ribs (Figure 19(c,f)). This transformation was present in every 

1 5 heterozygous animal examined, and in every case, the additional pair of ribs was attached 
to the sternum (Figure 19(i)). Hence, T8, whose associated rib normally does not touch 
the sternum, appeared to have been transformed to a morphology characteristic of a more 
anterior thoracic vertebra, and LI appeared to have been transformed to a morphology 
characteristic of a posterior thoracic vertebra. Other abnormalities indicative of anterior 

20 transformations were also seen to varying degrees in heterozygous mice. These included 
a shift of the long spinous process characteristic of T2 by one segment to T3, a shift of 
the articular and spinous processes from Tl 0 to Tl 1 , a shift of the anterior tuberculus on 
C6 to C7, and transformation of T2 to Tl where the rib associated with T2 touched the 
top of the sternum. 

25 In order to understand the basis for the abnormalities in axial patterning seen in GDF-1 1 
mutant mice, we examined mutant embryos isolated at various stages of development and 
compared them to wild-type embryos. By gross morphological examination, homozy- 
gous mutant embryos isolated up to day 9.5 of gestation were not readily distinguishable 
from corresponding wild-type embryos. In particular, the number of somites present at 
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any given developmental age was identical between mutant and wild-type embryos, 
suggesting that the rate of somite formation was unaltered in the mutants. By day 
10.5-1 1.5 p.c, mutant embryos could be easily distinguished from wild-type embryos by 
the posterior displacement of the hindlimb by 7-8 somites. The abnormalities in tail 
5 development were also readily apparent at this stage. Taken together, these data suggest 
that the abnormalities observed in the mutant skeletons represented true transformations 
of segment identities rather than the insertion of additional segments, for example, by an 
enhanced rate of somitogenesis. 

Alterations in expression of homeobox containing genes are known to cause transforma- 

10 tions in Drosophila and in vertebrates. To see if the expression patterns of Hox genes 
(the vertebrate homeobox containing genes) were altered in GDF-1 1 null mutants we 
determined the expression pattern of 3 representative Hox genes, Hoxc-6, Hoxc-8 and 
Hoxc-1 1, in day 12.5 p.c. wild-type, heterozygous and homozygous mutant embryos by 
whole mount in situ hybridization. The expression pattern of Hoxc-6 in wild-type 

15 embryos spanned prevertebrae 8-15 which correspond to thoracic segments T1-T8. In 
homozygous mutants, however, the Hoxc-6 expression pattern was shifted posteriorly 
and expanded to prevertebrae 9-1 8 (T2-T1 1). A similar shift was seen with the Hoxc-8 
probe. In wild-type embryos, Hoxc-8 was expressed in prevertebrae 13-18 (T6-T1 1 ) but, 
in homozygous mutant embryos, Hoxc-8 was expressed in prevertebrae 1 4-22 (T7-T1 5). 

20 Finally, Hoxc-1 1 expression was also shifted posteriorly in that the anterior boundary of 
expression changed from prevertebrae 28 tin wild-type embryos to prevertebrae 36 in 
mutant embryos. (Note that because the position of the hindlimb is also shifted 
posteriorly in mutant embryos, the Hoxc-1 1 expression patterns in wild-type and mutant 
appeared similar relative to the hindlimbs). These data provide further evidence that the 

25 skeletal abnormalities seen in mutant animals represent homeotic transformations. 

The phenotype of GDF-1 1 mice suggested that GDF-1 1 acts early during embryogenesis 
as a global regulator of axial patterning. To begin to examine the mechanism by which 
GDF-1 1 exerts its effects, we determined the expression pattern of GDF-1 1 in early 
mouse embryos by whole mount in situ hybridization. At these stages the primary sites 
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of GDF-1 1 expression correlated precisely with the known sites at which mesodermal 
cells are generated. Expression of GDF-1 1 was first detected at day 8.25-8.5 p.c. (8-10 
somites) in the primitive streak region, which is the site at which ingressing cells form 
the mesoderm of the developing embryo. Expression was maintained in the primitive 
5 streak at day 8.75, but by day 9.5 p.c, when the tail bud replaces the primitive streak as 
the source of new mesodermal cells, expression of GDF-1 1 shifted to the tail bud. Hence 
at these early stages, GDF-1 1 appears to be synthesized in the region of the developing 
embryo where new mesodermal cells arise and presumably acquire their positional 
identity. 

The phenotype of GDF-1 1 knockout mice in several respects resembles the phenotype 
of mice carrying a deletion of a receptor for some members of the TGF-P superfamily, 
the activin type IIB receptor (ActRUB). As in the case of GDF-1 1 knockout mice, the 
ActRIBB knockout mice have extra pairs of ribs and a spectrum of kidney defects ranging 
from hypoplastic kidneys to complete absence of kidneys. The similarity in the 
phenotypes of these mice raises the possibility that ActRUB may be a receptor for 
GDF-1 1. However, Act RIIB cannot be the sole receptor for GDF-1 1 because the 
phenotype of GDF-1 1 knockout mice is more severe than the phenotype of ActRUB 
mice. For example, whereas the GDF-1 1 knockout animals have 4-5 extra pairs of ribs 
and show homeotic transformations throughout the axial skeleton, the ActRUB knockout 
animals have only 3 extra pairs of ribs and do not show transformations at other axial 
levels. In addition, the data indicate that the kidney defects in the GDF-1 1 knockout 
mice are also more severe than those in ActRIIB knockout mice. The ActRUB knockout 
mice show defects in left/right axis formation, such as lung isomerixm and a range of 
heart defects that we have not yet observed in GDF-1 1 knockout mice. ActRIIB can bind 
the activins and certain BMPs, although none of the knockout mice generated for these 
ligands show defects in left/right axis formation. 

If GDF-1 1 does act directly on mesodermal cells to establish positional identity, the data 
presented here would be consistent with either short range or morphogen models for 
GDF-1 1 action. That is, GDF-1 1 may act on mesodermal precursors to establish patterns 
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of Hox gene expression as these cells are being generated at the site of GDF-11 
expression, or alternatively, GDF-1 1 produced at the posterior end of the embryo may 
diffuse to form a morphogen gradient. Whatever the mechanism of action of GDF-1 1 
may be, the fact that gross anterior/posterior patterning still does occur in GDF-1 1 
5 knockout animals suggests that GDF-1 1 may not be the sole regulator of ante- 
rior/posterior specification. Nevertheless, it is clear that GDF-1 1 plays an important role 
as a global regulator of axial patterning and that further study of this molecule will lead 
to important new insights into how positional identity along the anterior/posterior axis 
is established in the vertebrate embryo. 

10 Similar phenotypes are expected in GDF-8 knockout animals. For example, GDF-8 
knockout animals are expected to have increased number of ribs, kidney defects and 
anatomical differences when compared to wild-type. 

Although the invention has been described with reference to the presently preferred 
embodiment, it should be understood that various modifications can be made without 
15 departing from the spirit of the invention. Accordingly, the invention is limited only by 
the following claims. 
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CLAIMS 

1 . A method of producing animal food products having an increased number of ribs 
comprising: 

a) introducing a transgene disrupting or interfering with expression of 
growth differentiation factor-8 (GDF-8) into an embryo into germ cells of a 
pronuclear embryo of the animal; 

b) implanting the embryo into the oviduct of a pseudopregnant female 
thereby allowing the embryo to mature to full term progeny; 

c) testing the progeny for presence of the transgene to identify transgene- 
positive progeny; 

d) cross-breeding transgene-positive progeny to obtain further transgene- 
positive progeny; and 

e) processing the progeny to obtain foodstuff. 

2. The method of claim 1, wherein the transgene comprises GDF-8 antisense 
polynucleotides. 

3. The method of claim 1, wherein the transgene comprises a gene encoding a 
dominant negative GDF-8 polypeptide. 

4. A method of producing avian, porcine or bovine food products having an 
increased number of ribs comprising: 

a) introducing a transgene disrupting or interfering with expression of 
growth differentiation factor-8 (GDF-8) into an embryo of an avian, porcine or 
bovine animal; 

b) culturing the embryo under conditions whereby progeny are hatched; 

c) testing the progeny for presence of the transgene to identify transgene- 
positive progeny; 

d) cross-breeding transgene-positive progeny; and 

e) processing the progeny to obtain foodstuff. 
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5. The method of claim 4, wherein the transgene comprises GDF-8 antisense 
polynucleotides. 

6. The method of claim 4, wherein the transgene comprises a gene encoding a 
dominant negative GDF-8 polypeptide. 

7. The transgenic animal of claim 4, wherein the transgene comprises a 
polynucleotide encoding a truncated GDF-8 polypeptide. 

8. A method of treating a chronic or acute renal disease in a subject having such a 
disease, comprising: 

administering to the subject, a reagent which affects GDF-8 activity or 

expression. 

9. The method of Claim 8, wherein the reagent is an agonist of GDF-8. 

10. The method of claim 8, wherein the reagent is an antagonist of GDF-8. 

\ i m The method of claim 10, wherein the antagonist is an antibody to GDF-8. 

12. The method of claim 10, wherein the antagonist is an antisense 

polynucleotide to GDF-8. 
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TCT CCC ATT AAT ATG CTA TAT TTT AAT GGC AAA CAA CAA ATA ATA TAT GGG AAA ATT CCA 
S ? I N M L Y r W G X Z Q I I Y G X X ? 

GCC ATG GTA GTA GAC CGG TGT GGG TGC TCC TGA GCT TTG CAT TAG CTT TAA. AAT TTC CCA 
A M V V D R C G C S 

AAT CCT GGA AGG TCT TCC CCT CGA TTT CGA AAC TGT GAA. TTT ATG TAC CAC AGG CTG TAG 



Rat GDF-8 
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7— G~\ G~\ AAG CCA CAA TTA TCC ATA TAC TTG AGG CAA CTC CAA AAA CCT ACA ACG GTG 
L VVKAQLWIYLRQVQK?TTV 

' CAG ATC CTC ACA CTC ATT AAG CCC ATG AAA GAC GGT ACA ACA TAT ACT GGA ATT 

V V C I L R L I K ? M K D C T R Y T G I 

GGA TCT TTG AAA CTT GAC ATG AAC CCA GGC ACT GGT ATC TGG CAG ACT ATT GAT GTG AAG 



i - fi. 



0 M N ? G T G I V C S 



ACA GTG CTG CAA AAT TGG CTC AAA CAG CCT GAA TCC AAT TTA GGC ATC CAA ATA AAA CCT 
T V 1 Q M W L X C ? ^ S N L C I t I K A 

— G-r GAG ACT GGA CGA GAT CTT CCT GTC ACA TTC CCA GGA CCG GGT GAA GAT GGA TTG 
rr£CGRD!AVTr?G?G£DGL 

AAC TTT TTA GAG GTC AG A GTT ACA GAC ACA CCG AAA CGG TCC CGC AG A GAT TTT GGC 

\ } "= *r \ £ V R V T D T P K R S R R D F G 

C-^ G-C TGT GAT GAG CAC TC.; ACG GAA TCC CGA TGT TGT CGC TAC CCG CTG ACA GTG GAT 
L D C D £KST£SRCCRY?LTVD 

TTC CAA GCT TTT GGA TGG GAC TGG ATT ATA GCA CCT AAA AGA TAC AAA CCC AAT TAC TGC 
r H ArGWDWlIA?KRYKANYC 

1 GGA G^A TGC GAA TTT GTG TTT CTA CAG AAA TAC CCG CAC ACT CAC CTG GTA CAC CAA 

m y c Z C z. r V F L Q X Y P H T U L V H C 

GCA AAT CCC AGA GGC TCA GCA GGC CCT TGC TGC ACA CCC ACC AAG ATG TCC CCT ATA AAC 
Atf?RCSAG?CC7PTKMS?IN 

A-G CTG TAT TTC AAT GGA AAA GAA CAA ATA ATA TAT GGA AAG ATA CCA GCC ATG GTT GTA 
MCYFNGKSQIIYGKIPAMVV 

GAT CGT TGC GGG TGC TCA TGA GGC TGT CGT GAG ATC CAC CAT TCG ATA AAT TGT GGA AGC 
D R C G C S 

CAC CAA AAA AAA AAG CTA TAT CCC CTC ATC CAT CTT TGA AAC TGT GAA ATT ACG TAC GCT 
AGG CAT TGC C 
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GDF-8 SRRDFGLDCDEHSTESFaCCRYPLTVDF-EAFGWD-WI lAFKRYKANYjCSGE&FVFLGKYP 

GDF - 1 RPRROAE PVLGGGPGG^lCRARRL YVSF-RE VGWHRWV I AFRGFLANYtiGGdCALPVALSCSCGPP 

EMP-2 REKRQAXHKQRKRLKSSCKRhRYVDF-SDra^ — 

EMP-4 KRSPKWSCRARKK.wta — 

Vqr - 1 SRGSGSSDYNGSELKTACKKHELWSF-CCLGWCOW 1 1 AFKGYAANYGDGFjciSFPLNAHMNA — 

CP-1 LRMANVAE NSSSDCRGAbkKHE L YVSF -ROIGWCOW 1 1 APEGYMYY^GFXAF PL NSYMNA — 

EMP-5 SRMSSVGDYNTSECKQAlCKKHEL WSF-RDLGOT 1 1 AFEGYAAF YCCGEJSr PL NAJ-MNA — 

EMP-3 EQTLKXAKRKCW1EPRNCARRYLKVDF-AD 1GWSEWI I SPKSFOAYYpSGAjdCFPMPKSLKPS— 

M 1 5 GFCRAGRSAGATAADGFjdALRELSVDL RAERSVL I PE TYOANM^CGVpG'AFQSORNPRY-- 

inhibina ALRLLGRPFEEPAAHANCHRVALN I ff-OELGWERW I VYFPSF I FHYjCHjGCpLH ! PPNLSLPV- 

lnhibinj?A HRRRRRG LECDGKV-N I jdCKKCf F VSF-KO 1 G'/^NDW 1 1 APSGYHAIWjGEGECFSH 1 AGTSGSSL- 

InhibinjJB HRIRKRGLECDGRT-NLClCRGGf F IDF-RL IGWNOWI ( AP TGYYGNYjDEGsCPAYL AGVPGSA5- 

TGF- B\ HRRALDTNYCFSSTEKNCbvRGL Y I DFRKDLGWK-W ! HEPKGYHANFC1GPCPY I WSLD 

IGF- 82 KKRALDAAYCFRWCDI^CCLRFLY 1DFKR0LG'M<-WI HEFKGYN^lF;cjA.GACPYL'/VSSD 

TGF- 33 KKRA10TNYCFRNLEE^b/RPLY10FRO)LGW-MPKGYYANFjGSGFgPYLRSA0 



GDF-8 -HTHLVHQANFRG SAGFCCjT-PTKMSP I toiLYF-NGXEQl 1 YGK I PAMWORijfs 

GOF-1 ALNHAVLRALMHA— AATCMDLFlGV— PARLSP 1 SVLFF-ONSO^LRQYEDMVVOEcdcR 

EMP-2 -TNHAIVQTLVNS — VNSKIPKACGV-PTELSAISMLYL-OENEKWLKWGDMVVEGCgqR 

EMP-4 -TNHA I VQTLVNS — VNSS 1 PKACCV— PTELSA 1 SMLYl-DEYDKWLKNYGEMWEGCGjCR 

Vgr-1 -TNHA I VQTLVHL— MNPEYVPKPjCCiA— PTKLNA I SVL YF-OONSNV I LKKYRNMWRAjCpCH 

CP-1 -TNHA 1 VQTLVHF— I NPETVPKPpdA— PTQLNA I SVLYF-ODSSNV I LKKYRNMVYRACGjCH 

EMP-5 -TNHA I VQTLVHL— WFPDHVPKPCdA— PTKLNA I SVL YF-DDSSNV 1 LKKYRNMWRSjCQCH 

EWP _3 — NHAT [QS I VRA-VGWPG 1 PEFCCW---PEKM5SLS [ LFFH)E^B<^I^Art.KVYPI^TVES^«W 

MIS -GNHWLLLKMQA — RGAALARPFCCV — PTAYAGKLL I SLSEER— I SAHHVPNMVATE.CDCR 

I nh ibina -PGAPPTPACPYS LLPGA^CMLPGTMRPLHVRTTSlX^YSFKYETVPNLLTCHC^l 

inhibin/JA -SFHSTV 1 NHYRMRGHSPF ANLKSCG/ — PTKLRPMSMLYY-OCCGN ! 1KXDIGNMIVEECQCS 

Inhibin JS8 -SFHTA x /VNQYRMRGLNPGT-VNSCql— PTXLSTMSMLYF-DDEYN IVKRDVPNM 1 VEEjOXA 

TGF- B 1 -TQYSKVLALYNO— HNTOASAAppO/— PQALEPLPlV'fY-VGRKPKV-EGLSNMlVRSCwCS 

TGF- 82 -TCHSRVLSLYNT— 1 NPEASASPCGV— SGOLEPLT I LYY- 1GKTPK I -EGLSNM 1 VKSjCKCS 

TGF- Bi -TTHSTVLGLYNT—LNPEASASFCCjV— PGDLEPLT I LYY-VGRTPKV-EGLSNMV/KSCKiK 

FIG.3c 
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1 G TC TC TCGGACGG TACATGCACTAATAT TTCACT TGGCAT TACTCAAAACCAAAAAG AAG 60 
61 AAATAAGAACAAGGGAAAAAAAAAGATTGTGCTGATTTrTAAAATGATGCAAAAACTGCA 120 

M M Q K L 0 

121 AATGTATGTTTATATTTACCTGTTCATGCTGATTGCTGCTCGCCCAGTGGATCTAAATGA 180 

MYVYIYLFML IAAGPVOLNE 
181 GGGCAG TGAG AG AGAAGAAMTG TGG AAAAAGAGGGGC TG TG T AATGC ATG TGCGTGGAG 240 

CSEREENVEKEGLCNACAWR 
241 ACAAAACACGAGGTACTCCAGAATAGAAGCCATAAAAATTCAAATCCTCAGTAAGCTGCG 300 

ONTRYSRIEAIKIOILSKLR 
301 CC TCGAAACAGC TCC TAACATCAGCAAAGATGCT AT AAGACAAC T TCTGCCAAGAGCGCC 360 

L E T A P [N^T."$1 KDAIRQLIPRAP 
36 1 TCCACTCCGGGAACTGATCGATCAGTACGACGTCCAGAGGGATGACAGCAGTGATGGCTC 420 

PLRELIOQYOVOROOSSOGS 
42 1 TTTGGAAGATGACGATTATCACGCTACCACGGAAACAATCATTACCATGCCTACAGAGTC 480 

LEDDDYHATTET I I TMPTES 
481 TGAC T T TCT AATGCAAGCGGATGGCAAGCCCAAATG T TGCT TT TT TAAAT TTAGCTC TAA 540 

OF IMQADGKPKCCFFKFSSK 
541 AA TACAG TACAACAAAG TAG TAAAAGCCCAACTG TGGATATATCTCAGACCCG TCAAGAC 600 

10YNKVVKAQLWI YLRPVKT 
601 TCCTACAACAGTGTTTGTGCAAATCCTGAGACTCATCAAACCCATGAAAGACGGTACAAG 660 

PTTVFVOILRL IKPMKOGTR 
661 GTATACTGGAATCCGATCTCTGAAACTTGACATGAGCCCAGGCACTGGTATTTGGCAGAG 720 

YTGIRSLKL0M5PGTGIWQS 
721 TATTGATGTGAAGACAGTGTTGCAAAATTGGCTCAAACAGCCTGAATCCAACTTAGGCAT 780 

IDVKTVLQNWLKQPESNLG 1 
781 TGAAATCAAAGCTTTGGATGAGAATGGCCATGATCTTGCTGTAACCTTCCCAGGACCAGG 840 

E1KAL0ENGHDLAVTFPGPG 
841 AGAAGATGGGCTGAATCCCT TT T TAG AAG TCAAGGTGACAGACACACCCAAGAGGTCCCG 900 

EDGLNPFLEVKVTDTP K I R S R 
901 GAGAGACTTTGGGCTTGACTGCGATGAGCACTCCACGGAATCCCGGTGCTGCCGCTACCC 960 

T]D FGLOCDEHSTESRCCRYP 
961 CCTCACCGTCGATTTTGAAGCCTTTCGATGCGACTCCATTATCGCACCCAAAAGATATAA 1020 

LTVOFEAFGWDWI IAPKRYK 
1021 GGCCAATTACTGCTCAGGAGAGTGTGAATTTGTGTTTTTACAAAAATATCCGCATACTCA 1080 

ANYCSGECEFVFLQKYPHTH 
1 08 1 TCTTGTGCACCAAGCAAACCCCAGAGGCTCAGCAGGCCCTTGCTGCACTCCGACAAAAAT 1 1 40 

LVHQANPRGSAGPCCTPTKM 
1141 GTCTCCCATTAATATGCTATATTTTAATGGCAAAGAACAAATAATATATGGGAAAATTCC 1200 

SPINMLYFNGKEQi IYGK1P 
1 201 AGCCATGGT AGT AGACCGCTGTGGGTGCTCATGAGCT T TGCAT T AGGT TAGAAACT TCCC 1 260 

AMVVDRCCCS* 

FIG.5a 
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126'. AAGTCATGGAAGGTCTTCCCCTCAATTTCGAAACTGTGAATTCAAGCACCACAGGCTGTA 1320 

1321 GGCCTTGAGTATGCTCTAGTAACGTAAGCACAAGCTACAGTGTATGAACTAAAAGAGAGA 1380 

1381 ATAGATGCAATGGTTGGCATTCAACCACCAAAATAAACCATACTATAGGATGTTGTATGA 1440 

1441 T T TCCAG ACT T T T TGAMT AGATGG AGATCAMT T ACAT T TATG TCCAT ATATG TAT AT T 1500 

1501 ACAACTACAATCTAGGCAAGGAAGTGAGAGCACATCTTGTGGTCTGCTGAGTTAGGAGGG 1560 

1561 TATGATTAAAAGGTAAAGTCTTATTTCCTAACAGTTTCACTTAATATTTACAGAAGAATC 1620 

1621 TATATGTAGCCTTTGTAAAGTGTAGGATTGTTATCATTTAAAAACATCATGTACACTTAT 1680 

1681 ATTTGTATTGTATACTTGGTAAGATAAAATTCCACAAAGTAGGAATGGGGCCTCACATAC 1740 

1741 ACATTGCCATTCCTATTATAATTGGACAATCCACCACGGTGCTAATGCAGTGCTGAATGG 1800 

1801 CTCCTACTGGACCTCTCGATACAACACTCTACAAAGTACGAGTCTCTCTCTCCCTTCCAG 1860 

1861 GTGCATCTCCACACACACAGCACTAAGTGTTCAATGCATTTTCTTTAAGGAAAGAAGAAT 1920 

1921 CTTTTTTTCTAGAGGTCAACTTTCAGTCAACTCTAGCACAGCGGGAGTGACTGCTGCATC 1980 

1981 TTAAAAGGCAGCCAAAoaGiATTCATTTTTTAATCTAMTTTCAAAATCACTGTCTGCCT 2040 

2041 TTATCACATGGCAATTTTGTGGTAAAATAATGGAAATGACTGGTTCTATCAATATTGTAT 2100 

2101 AAAAGACTCTGAAACAATTACATTTATATAATATGTATACAATATTGTTTTGTAAATAAG 2160 

2161 TGTCTCCTTTTATATTTACTTTGGTATATTTTTACACTAATGAAATTTCAAATCATTAAA 2220 

2221 GTACAAAGACATGTCATGTATCACAAAAAAGGTGACTGCTTCTATTTCAGAGTGAATTAG 2280 

2281 CAGATTCAATAGTGGTCTTAAAACTCTGTATGTTAAGATTAGAAGGTTATATTACAATCA 2340 

234 1 ATTTATGTATTTTTTACATTATCAACTTATGGTTTCATGGTGGCTGTATCTATGAATGTG 2400 

2401 GCTCCCAGTCAAATTTCMTGCCCCACCATTTTAAAAATTACAAGCATTACTAAACATAC 2460 

2461 CAACATGTATCTAAAGAAATACAAATATGGTATCTCAATAACAGCTACTTTTTTATTTTA 2520 

2521 TAATTTGACAATGAATACATTTCTTTTATTTACTTCAGTTTTATAAATTGGAACTTTGTT 2580 

2581 TATCAMTGTATTGTACTCATAGCTAAATGAAATTATTTCTTACATAAAMTGTGTAGAA 2640 

2641 ACTATAAATTAAAGTGTTTTCACATTTTTGAAAGGC 2676 

FIG.5b 
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AAGAAAAGTAAAAGGAAGAAACAAGAACAAGAAAAAAGATTATATTGATTTTAAAATCAT 60 

M 

GCAAAAAC TGCAAC TC TGTG T T TAT AT TTACCTGTTTATGCTGATTGT TGC TGG TCCAC T 120 
QKLQLCVYIYLFML IVAGPV 

GGATCTAAATGAGAACAGTGAGCAAAAAGAAAATGTGGAAAAAGAGGGGCTGTGTAATGC 180 
DLNENSEQKENVEKEGLCNA 

ATGTACTTGGAGACAAAACACTAAATCTTCAAGAATAGAAGCCATTAAGATACAAATCCT 240 
CTWRQNTKSSRIEAIKIQIL 

CAGTAAACTTCGTCTGGAAACAGCTCCTAACATCAGCAAAGATGTTATAAGACAACTTTT 300 



S K L R L E T A P .N I - S K 0 V I R Q L L 



ACCCAAAGCTCCTCCACTCCGGGAACTGATTGATCAGTATGATGTCCAGAGGGATGACAG 360 

PKAPPLREL 1 DQYDVORDOS 
CAGCGATGGCTCTTTGGAAGATGACGATTATCACGCTACAACGGAAACAATCATTACCAT 420 

SOGSLEDDOYHATTETi ITM 
GCC T ACAG AG TC TG AT T T TCTAATGCAAG TGGATGGAAAACCCAMTG TTGCTTCTTTAA 480 

PTESDFLMOVDGKPKCCFFK 
ATT T AGC TCT AAAATACAATACAATAAAC TAG TAAAGGCCCAAC TATGG AT ATAT T TG AG 540 

FSSK IQYNKVVKAQLWIYLR 
ACCCGTCGAGACTCCTACAACAGTGTTTGTGCAAATCCTGAGACTCATCAAACCTATGAA 600 

PVE TPTTVFVQILRL IKPMK 
AGACGG TACAAGG TATACTGGAATCCGATCTCTGAAAC T TGACATGAACCCAGGCAC TGG 660 

DGTRYTGIRSLKLDMNPGTG 
TATTTGGCAGAGCATTGATGTGAAGACAGTGTTGCAAAATTGGCTCAAACAACCTGAATC 720 

I WQS IDVKTVLQNWLKQPES 
CAACTTAGGCATTGAAATAAAAGCTTTAGATGAGAATGGTCATGATCTTGCTGTAACCTT 780 

NLG IE IKALOENGHDLAVTF 
CCCAGGACCAGGAGAAGATGGGCTGAATCCGTTTTTAGAGGTCAAGGTAACAGACACACC 840 

PGPGEDGLNPFLEVKVTDTP 
AAAAAGATCCAG AAGGGAT TTTGG TC T TG ACTG TGATGAGCAC TCAACAGAATCACG ATG 900 



K |R S R Rl OFGLOCOEHSTESRC 
CTGTCGTTACCCTCTAACTGTGGATTTTGAAGCTTTTGGATGGGATTGGATTATCGCTCC 960 

CRYPLTVOFEAFGWOWI 1 AP 
TAAAAGATATAAGGCCAATTACTGCTCTGGAGAGTGTGAATTTGTATTTTTACAAAAATA 1020 

KRYKANYCSGECEFVFLOKY 
TCCTCATACTCATCTGGTACACCAAGCAAACCCCAGAGGTTCAGCAGGCCCTTGCTGTAC 1080 

PHTHLVHQANPRGSAGPCCT 
TCCCACAMGATGTCTCCAATTAATATGCTATATTTTAATGGCAAAGAACAAATAATATA 1 1 40 

PTKMSPINMLYFNGKEQI I Y 
TGGGAAAATTCCAGCGATGGTAGTAGACCGCTGTGGGTGCTCATGAGATTTATATTAAGC 1200 

GK i PAMVVORCGCS * 



FIG.5c 
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1201 G T TC A T AAC T TCC T AAAACATGGAAGG T T T TCCCCTCAACAAT T T TGAAGC TG TG AAAT T 1260 

1261 AA.GTACCACAGGCTATAGGCCTAGAGTATGCTACAGTCACTTAAGCATAAGCTACAGTAT 1320 

1321 GTAAACTAAAAGGGGGAATATATGCAATGGTTGGCATTTAACCATCCAAACAAATCATAC 1380 

1381 AAGAAAGTTTTATGATTTCCAGAGTTTTTGAGCTAGAAGGAGATCAAATTACATTTATGT 1440 

1441 TCCTATATATTACAACATCGGCGAGGAAATGAAAGCGATTCTCCTTGAGTTCTGATGAAT 1500 

1501 TAAAGGAGTATGCTTTAAAGTCTATTTCTTTAAAGTTTTGTTTAATATTTACAGAAAAAT 1560 

1561 CCACATACAGTATTGGTAAAATGCAGGATTGTTATATACCATCATTCGAATCATCCTTAA 1620 

1621 ACACTTGAATTTATATTGTATGGTAGTATACTTGGTAAGATAAAATTCCACAAAAATAGG 1680 

1631 GATGGTGCAGCATATGCAATTTCCATTCCTATTATAATTGACACAGTACATTAACAATCC 1740 

1741 ATGCCAACGGTGCTAATACGATAGGCTGAATGTCTGAGGCTACCAGGTTTATCACATAAA 1800 

1801 AAACATTCAGTAAAATAGTAAGTTTCTCTTTTCTTCAGGTGCATTTTCCTACACCTCCAA 1860 

1861 ATGAGGAATGGATTTTCTTTAATGTAAGAAGAATCATTTTTCTAGAGGTTGGCTTTCAAT 1920 

1921 TCTGTAGCATACTTGGAGAAACTGCATTATCTTAAAAGGCAGTCAAATGGTGTTTGTTTT 1980 

1981 TATCAAMTGTCAAAATAACATACTTGGAGAAGTATGTAATTTTGTCTTTGGAAAATTAC 2040 

2041 AACACTGCCTTTGCAACACTGCAGTTTTTATGGTAAAATAATAGAAATGATCGACTCTAT 2100 

2101 CAATATTGTATAAAAAGACTGAAACAATGCATTTATATAATATGTATACAATATTGTTTT 2160 

2161 GTAAATAAGTGTCTCCTTTTTTATTTACTTTGGTATATTTTTACACTAAGGACATTTCAA 2220 

2221 ATTAAGTACTAAGGCACAAAGACATGTCATGCATCACAGAAAAGCAACTACTTATATTTC 2280 

2731 AGAGCAAATTAGCAGATTAAATAGTGGTCTTAAAACTCCATATGTTAATGATTAGATGGT 2340 

2341 TATATTACAATCATTTTATATTTTTTTACATGATTAACATTCACTTATGGATTCATGATG 2400 

2401 GCTGTATAAAGTGAATTTGAAATTTCAATGGTTTACTGTCATTGTGTTTAAATCTCAACG 2460 

2461 TTCCATTATTTTAATACTTGCAAAAACATTACTAAGTATACCAAAATAATTGACTCTATT 2520 

2521 ATCTGAAATGAAGAATAAACTGATGCTATCTCAACAATAACTGT T ACT T T TAT T T T AT AA 2580 

2581 TTTGATAATGAATATATTTCTGCATTTATTTACTTCTGTTTTGTAAATTGGGATTTTGTT 2640 

2641 AATCAAATTTATTGTACTATGACTAAATGAAATTATTTCTTACATCTAATTTGTAGAAAC 2700 

2701 AGTATAAGTTATATTAAAGTGTTTTCACATTTTTTTGAAAGAC 2743 
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1 fyMQKLGMYVY 1 YLFML I MGPVDL^GSEREENVEKEGLCNACAWRQNTR 50 
1 MQKLQLCVY 1 YLFML 1 VAGPVOLNENSEQKENVEKEGLCNACTWRQNTK 49 
51 YSR 1 E A I K I Q I LSKLRLE T APN 1 5K0A I RQL LPRAPPLRE L 1 OQYDVGRD 100 

iimiiiiiiiiiiiiimiiii mm mimiimim 

50 SSR I EA I K I Q 1 LSKLRLE T APN I SKDV I RQLLPKAPPLREL I DOYOVQRD 99 
101 0SS0GSLE000YHATTET 1 i TMPTESDFLMQADGKPKCCFFKFSSKIQYN 150 

iiiiiiiiimiimmimmmi imimmiiiim t 

100 OSSOGSLEDODYHATTET i I TMPTES0FLMQV0GKPKCCFFKFSSKIOYN 149 
151 KWKAQLW1 YLRPVKTPTTVF VQ 1 LRL I KPMKDGTRYTG I RSLKLOMSPG 200 

iiiiiiimim imimmmmmmmmm n 

150 KWKAQLWI YLRPVETPTTVFVQ ! LRL I KPMKCGTRYTG IRSLKLOMNPG 199 
201 TG 1 WQS IDVKTVLQNWLKQPESNLG 1 E 1 KALDENGHDLAVTFPGPGEDGL 250 

iiiiiiiiiiiiiimiimiimmmmmimmmi _ 

200 TG IWQS IDVKTVLQNWLKQPESNLG IE IKALDENGHDLAVTFPGPGEDGL 249 
251 NPFLEVKVTOTPKRSRRDFGLDCOEHSTESRCCRYPLTVOFEAF GWOWI I 300 

miiiiimiimmmimmmimmimiimii oqq 

250 NPF LE VKVTOTPKRSRROFGLDCOEHSTESRCCRYPL TVOFE AF GWDW 1 1 299 
301 APKRYKANYCSGECEFVFLQKYPHTHLVHQANPRGSAGPCCTPTKMSPIN 350 

iiiiiiiiiiiiiiiiiiiiiimimmmmmmmm 

300 APKRYKANYCSGECEFVF LQKYPHTHLVHQANPRGSAGPCCTPTKMSP I N 349 
351 MLYFNGKEQI IYGK1PAMW0RCGCS 376 
350 MLYFNGKEQI IYGKIPAMWORCGCS 375 
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TICURE IVa 

1/1 31/11 

atc caa aaa ctc caa ctc tct ctt tat att tac ctc ttt atc ctg att ctt cct cct cca 

MQKUCLCVYITLFMLIVACP 

o*./:: 9i/3i 

CTC CAT CTA AAT CAC AAC ACT CAC CAA AAA CAA AAT CTC GAA AAA CAC CCC CTC TCT AAT 
V D L N £ M S ^CKEKV£K£CLCH 

121/4: isi/si 

CCA TCT ACT TCC AC A CAA AAC ACT AAA TCT TCA ACA ATA CAA CCC ATT AAA ATA CAA ATC 
ACT W * C NT KSSRrZAlKIQl 
131/oi 211/71 

CTC ACT AAA CTT CCT CTC CAA ACA CCT CCT AAC ATC ACC AAA CAT CCT ATA ACA CAA CTT 
LSXLaLIT.APNISKOAIRO'L 
:U/31 271/91 

TTA CCC AAA CCT CCT CCA CTC CCC CAA CTC ATT CAT CAC TAT CAT CTC CAC ACC CAT CAC 

L ? X A ? ? L ?. £l:dqyovq%oo 
301/101 331/111 

ACC ACC CAT CCC TCT TTC CAA CAT CAC CAT TAT CAC CCT ACA ACC CAA ACA ATC ATT ACC 
SSOCSLtD 30YKATTETIIT 
351/121 391/131 

ATC CCT ACA CAC TCT CAT TTT TTA ATC CAA CTC CAT CCA AAA CCC AAA TCT TCC TTC TTT 
« P T E S D F LMQVDCKPKCCr? 
421/141 451/151 

AAA TTT ACC TCT AAA ATA CAA TAC AAT AAA CTC CTA AAC CCC CAA CTA TCC ATA TAT TTC 
KcSSXIQY MXVVXAQCWIYL 

431/is: 5:i/:7: 

ACA CCC CTC CAC ACT CCT ACA ACA CTC TTT CTC CAA ATC CTC ACA CTC ATC AAA CCT ATC 
?.?•*£ T ? T T V F V Q : L * L Z X ? M 
=41/131 571/191 

AAA CAC CCT ACA ACC TAT ACT CCA ATC CCA TCT CTC AAA CTT CAC ATC AAC CCA CCC ACT 

x c . c t a r t c : rslx l d « n ? c t 

501/201 631/211 

ctt att tcc cac acc att cat ctc aac aca ctc ttc caa aat tcc ctc aaa caa cct caa 
c:-cs:cv:<tvlcnwlxqpe 

€51/221 691/231 

TCC AAC TTA CCC ATT CAA ATA AAA CCT TTA CAT CAC AAT CCT C\T CAT CTT CCT CTA ACC 
S N L C I E I KALCENCHCLAVT 
711/241 7;1/251 

TTC CCA CCA CCA CCA CAA CAT CCC CTC AAT CCC TTT TTA CAC CTC AAC CTA ACA CAC ACA 
FPCPGICC LMPFLZVKVTDT 
731/251 811/271 

CCA A 1 A ACA TCC ACA ACC CAT TTT CCT CTT CAC TCT CAT CAC CAC TCA ACA CAA TCC CCA 
PXRSRADFGLDCDEHSTESR 
341/231 671/291 

TCC TCT CCT TAC CCT CTA ACT CTC CAT TTT CAA CCT CTT CCA TCC CAT TCC ATT ATC CCT 
CCaY PLTVDFEACCWDWIlA 
901/301 931/311 

CCT AAA ACA TAT AAC CCC AAT TAC TCC TCT CCA CAC TCT CAA TTT CTA TTT TTA CAA AAA 

?XRYXANrCSCICErVFLQX 
951/321 991/321 

TAT CCT CAT ACT CAT CTC CTA CAC CAA CCA AAC CCC ACA CCT TCA CCA CCC CCT TCC TCT 
V ? K T H L V H C A N ? A C S A C ?" C C 
1C21/341 1051/351 

ACT CCC ACA AAC ATC TCT CCA ATT AAT ATC CTA TAT TTT AAT CCC AAA CAA CAA ATA ATA 

" ? ~ x M s ? : II y. L Y ? M C X - r T 
1C31/351 1111/371 

TAT CCC AAA ATT CCA CCC ATC CTA CTA CAC CCC TCC CCC TCC TCA TCA 
YCKIPAMVVORCCCS* 
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FICTILE 14b 



1/1 31/tl 

ATC CU AAA CTC CW ATC TCT CTT TAT ATT TAC CTA TTT ATC CTC ATT CTT CCT CCC CCA 
wCKLCISVYIYLFML-IVACP 

CTC CAT CTC AAT CAC AAC ACC CAC CAC AAC CAA AAT CTC CAA AAA CAC CCC CTC TCT AAT 
C ^N£NSEQK£NVEICECLCW 

121/41 isi/st 

CCA TCT TTC TCC ACC CAA AAC ACT ACA TCC TCA ACA CTA CAA CCC ATA AAA ATC CAA ATC 

f C L W R £ N T T S S R LEA I K I Q I 
131/51 211/71 

CTC ACT AAA CTT CCC CTC CAA ACA CCT CCT AAC ATC ACC AAA CAT CCT ATC ACA CAA CTT 

t " $ £ L R L E T A ? N I S K OA I R Q L 
241/31 271/91 

rrc ccc aac cct err cca ctc ere caa ctc att cat cac ttc cat ctc cac aca cat ccc 

I ? X A ? ? LLELICQFDVQROA 
3C1/1S1 331/111 

ACC ACT CAC CCC TCC TTC CAA CAC CAT CAC TAC CAC CCC ACC ACC CAA ACC CTC ATT ACC 
SSCCSLSCOOYKARTETVIT 

2$i/i:i 391/13 1 

ATC CCC ACC CAC TCT CAT CTT CTA ACC CAA CTC CAA CCA AAA CCC AAA TCT TCC TTC TTT 
w = T£SDLLTQVECKP3CCCF? 
421/141 4S1/1S1 

AAA TTT ACC TCT AAC ATA CAA TAC AAT AAA CTA CTA AAC CCC CAA CTC TCC ATA TAT CTC 
x T SS3CIQY MK&.VXAQLW IY L. 

43:/:=-; su/171 

*. -C CCT CTC ACT CCT CCC ACA CTC TTT CTC CAA ATC CTC ACA CTC ATC AAA CCC ATC 

:•: t ? a t v f v r : Las, : z ? x 
=;i/i:i s?i/i?i 

AAA CAC CCT ACA ACC TAT ACT CCA ATC CCA TCT CTC AAA CTT CAC ATC AAC CCA CCC ACT 
*<OCTRYTC IP. SLKLCXXPCT 
601/2C1 631/311 

CCT ATT TCC CAC ACC ATT CAT CTC AAC ACA CTC TTC CAC AAC TCC CTC AAA CAA CCT CAA 
C3-CS : DVXTV LQMWLKQ? E 
fiol/231 591/231 

TCC AAC TTA CCC ATT CAA ATC AAA CCT TTA CAT CAC AAT CCC CAT CAT CTT CCT CTA ACC 
SN LCIE I XALOENCHD L A V T 
721/341 751/251 

TTC CCA CAA CCA CCA G.-A CAT CCA CTC ACT CCT TTT TTA CAA CTC AAC CTA ACA CAC ACA 
F P Z ? C Z t C I. T ? ? L ZVK V T 0 T 
731/251 311/271 

CCA AAA ACA TCT ACC ACA CAT TTT CCC CTT CAT TCT CAT CAA CAC TCC ACA CAA TCT CCA 
PXRSAROrCLOCDEHSTESR 
841/231 871/291 

TCC TCT CCT TAC CCT CTA ACT CTC CAT TTT CAA CCT TTT CCA TCC CAT TCC ATT ATT CCA 
C C AYPLTVDFEArCWDWIIA 
SC1/3C1 931/311 

CCT AAA ACA TAT AAC CCC AAT TAC TCC TCT CCA CAA TCT CAA TTT CTA TTT TTC CAA AAC 
PKRYiCANYCSCECEFVFLCX 
? -51/321 991/331 

TAT CCT CAT ACC CAT CTT CTC CAC CAA CCA AAC CCC ACA CCT TCA CCC CCC CCC TCC TCT 
YPKTHLVKQANPRC SAC PCC 
1C21/34 1 i:si/3Si 

ACT CCT ACA AAC ATC TCT CCA ATT AAT ATC CTA TAT TTT AAT CCC CAA CCA CAA ATA ATA 
T ? T X K S ? I NMLYF NCcCQl 1 
1C31/35 1 1111/371 

TAC CCC A\C ATT CCA CCC ATC CTA CTA CAT CCC TCT CCC TCT TCA TCA 
f C X I ? A M V V C R C C C S 
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FIGURE 14c 

1/1 

ATC CAA AAC CTA CCA CTC TAT CTT TAT ATT TAC CTC TTC ATG CAC ATC CCC CTT CAT CCC 
wQKLAVlfVYIYLFMQlAVDP 
61/21 91/31 

CTC CCT CTC CAT CCC ACT ACT CAC CCC ACA CAC AAC CCT CAA AAA CAC CCA CTC TCC AAT 
-/ * Z C S S Q P T £ N A £ KDCLCK 

121/41 isi/si 

CCT TCT ACC TCC ACA CAC AAT ACA AAA TCC TCC ACA ATA CAA CCC ATA AAA ATT CAA ATC 
^"*CTV*C?fTKSSRIEAIKlQI 

iai/fii 2X1/71 

CTC ACC AAA CTC CCC CTC CAA CAA CCA CCT AAC ATT ACC ACC CAC CTT ATT AAC CAC CTT 
L **" SKL^-E-CAPMISROVIXQL 
: . :/31 271/91 

~"A CCC AAA CCT CCT CCA CTC CAC CAA CTC ATT CAT CAC TAT CAT CTC CAC ACC CAC CAC 
L* p X A ? ? L Q E LI DQY D V Q RO 0 
301/101 331/111 

ACT ACC CAT CCC TCT TTC CAA CAC CAT CAC TAT CAT CCC ACA ACC CAC ACC ATT ATC ACA 
SS0CSLEDOOYHATTETIIT 
351/121 391/131 

ATC CCT ACC CAC TCT CAT TTT CTT CTA CAA ATC CAC CCA AAA CCA AAA TCT TCC TTC TTT 
j/pTES C F L VQMECK P K C C F F 
421/141 4S1/1S1 

AAC TTT ACC TCT AAA ATA CAA TAT AAC AAA CTA CTA AAC CCA CAA TTA TCC ATA TAC TTC 
;< F S SK I Q Y XVVKA C L W IY L 

451/151 ~ £11/171 

ACC CAA CTC CAA CCT ACA ACC CTC TTT CTC CAC ATC CTC ACA CTT ATT AAC CCT ATC 

t r V C >* ? T T V ? V C I L A I- I X r K 
V; 1/131 £71/151 

*AA CAC CCT ACA ACA TAT ACT CCA ATT CCA TCT TTC AAA CTT CAC ATC AAC CCA CCC ACT 
SDCTRYTC I RSLKLDMNPCT 
□C1/2C1 £31/211 

CCT ATT TCC CAC ACT ATT CAT CTC AAC ACA CTC CTC CAA AAT TCC CTC AAA CAC CCT CAA 
CIWQS: CVXTVLQNWLXQ? Z 
££1/221 631/231 

tcc aat tta ccc atc caa ata aaa cct ttt cat cac act cca cca cat ctt cct ctc aca 
snlcce: kafdetcrolavt 

721/241 721/2*1 

CTA CCA CCC C - T CAA CAT CCA TTC AA.C CCA TTT TTA CAC CTC ACA CTT ACA CAC ACA 
j o C ? C Z C C L.MPFLEVRVTDT 
731/251 311/271 

CCC AAA CCC TCC CCC ACA CAT TTT CCC CTT CAC TCT CAT CAC CA* 2CA ACC CAA TCC CCA 
r 5 R S R ?. CFCLOCDEHSTESR 
341/231 871/291 

TCT TCT CCC TAC CCC CTC ACA CTC CAT TTC CAA CCT TTT CCA TCC CAC TCC ATT ATA CCA 
C*CRY *?LTVCFEA FCWDWIIA 
9C1/301 931/311 

CCT AAA ACA TAC AAA CCC AAT TAC TCC TCC CCA CAA TCC CAA TTT CTC TTT CTA CAC AAA 
PKRYXA^YCSCECErVFLQX 
951/321 991/331 

TAC CCC CAC ACT CAC CTC CTA CAC CAA CCA AAT CCC ACA CCC TTA CCA CCC CCT TCC TCC 

r ? h t :-: *. v k c a a p r c s a c ? c c 

1221/341 1351/35 1 

ACA CCC ACC AAC ATC TCC CCT ATA AAC ATC CTC TAT TTC AAT CCA AAA CAA CAA ATA ATA 

t ? t :< m s ? : nmlyfnckeq: 2 
::3i/3fii 1111/371 

TAT CCA AAC ATA CCA CCC ATC CTT CTA CAT CCT TCC CCC TCC TCA TCA 
YCKIPAMVVDRCCCS 
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fictot i4d 



1/1 

ATC ATT CAA AU> CZZ 
M I Q ^ ? 
Sl/21 

CCA CTC CAT CTA AAT 
d V D L N 

::i/4t 

aat ccc tct ccc tcc 

M A C A 
131/61 

ATC ctc a-, ctc 
: c s K L 
;;i/6i 

err ctc ccc aca ccc 

L L ? R A 
3C1/10I 

CAC ACC Aw. CAC CCC 
D S 5 C C 
351/121 ^ 

T H ? T I 

<:i/m 

~ . 7 AAA * • * AC*. ... 

r k * ? s s 

ctc ac a ccc err aac 

l a a v x 
5;:/i3*. 

ATC AAA CAC CCT ACA 

X K 0 C T 
SCI/201 

ACT CCT ATT TCn* CAw 

t c : W Q 

CAA TCC AAC TTA CCC 
I S K L C 
711/241 

ACC TTC CCA CCA CCA 



caa at: 
Q « 



CTC CTC ATT CCT CCT CCC 
V L I A A C 



CAC AC 
D S 



7 51/251 

ACA CCT AAC ACC * - — 
7 P JC R S 
S^l/231 

a c c r y 

501/301 

CCA CCC AAA ACA TAT 
A ? K R T 

951/321 

AAA TAT CTC CAT ACT 
T. Y ? H T 

i::i/:;: 

TCC ACC CCA ACA AAA 
C T ? T X 

: :3i/3o : 

ATA TAT CCC AAA ATT 

: y c k : 



31/U 

CTT TAT ATT TAC CTC T 
V Y I Y L F 
91/31 

CAC ACA CAC CCC AAT CTC CAA AAA CAC CCC CTC TCT 
£REANV£KECLC 
151/51 

ACA CAA AAC ACA ACC TAC TCC ACA ATA CAA CCC ATA AAA ATT CAA 

a C n t ?. y s a : £ a I :< : Q 

211/71 

ccc ctc caa aca ccc cct aac atc acc aaa cat cct ata aca caa 
al£ta?n:skda:rq 

271/91 

CCT CCA CTC CCC CAA CTC ATC CAT CAC TAC CAC CTC CAC ACC CAT 
?? La£LICCYDVC.RD 
331/111 

TCT TTC CAA CAT CAC CAT TAT CAC CCT ACC ACC CAA ACA ATC ATT 
SLED2DYHATTETII 
391/131 

TCT CAC TTT CTA ATC CAA CCC CAT CCA AAC CCC AAA TCT TCC TTT 
SGrLMCACCZPXCCr 
451/151 

AAA ATA CAC TAC AAC AAA CTC CTA AAC CCC CAC CTC TCC ATA TAT 
RIQYKKVVXAQLVIY 

511/171 

ACT CTT ACA ACA CTC TTT CTC CAA ATC CTC ACA CTC ATC AAA CCC 
TPTTVFVCrt.?. LIS? 
571/191 

ACC TAT ACC CCA ATC CCA TCT CTC AAA CTT CAC ATC ACC CCA CwC 
RYTCCRSLXLDKSPG 
631/211 

ACT ATT CAT CTC AAC ACA CTC TTC CAA AAT TCC CTC AAA CAC CCT 
SXDVK7VLQ-NWLXQ? 
o51/231 

ATT CAA ATT AAA CCT TTC CAT CAC AAT CCC CAT CAT CTT CCT CTA 
X£ Z JCALDESfCKDLAV 
7*1/251 

C CTC AAT CCC TTT TTA CAA CTC AAA CTA ACA CAC 
L N ? L Z V :< V T C 

511/271 

7 CCC CTT CAC TCT CAT CAA CAC TCC ACC CAA TCC 
CLDCCSMSTIS 
871/231 

CCC CTC ACC CTC CAT TTC CAA CCC TTT CCA TCC CAC TCC ATT ATT 
PLTVCFIArCWDWlI 
931/3 li 

AAC CCT AAT TAC TCC TCT CCA CAC TCT CAA TTT CTC TTC TTA CAA 
XAKYCSCECEFVfLQ 
991/321 

CAT CTT CTC CAC CAA CCA AAC CCC ACA CCC TCC CCA CCC CCT TCC 
>; l v :-: C A M ? ?. C S A C ? C 
ICS 1/3 5 1 

ATC TCT CCC ATT AAT ATC CTA TAT TTT AAT CCC AAA CAA CAA ATA 
S ? I S* M L Y F N C K £ C I 
1111/371 

CCA CCC ATC CTA CTA CAC CCC TCT CCC TCC TCC TCA 
?AMVVDRCCCS" 



CA CCA CAA CA" 



ACA CAC 
R D 
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F1U LIRE 14hc 
1/1 31/11 

A -C CAA AAC CTA CCA CTC TAT CT7 TAT ATT TAC CTC TTC ATC CAC ATT TTA CTT CAT CCC 

tf * c Xl-XVTVYITI.PKQlLVKP 

61/21 

cTC cr -7 CTT CAT CCC ACT ACT CAC CCC ACA CAC AAC CCT CAA AAA CAC CCA CTC TCC AAT 
v'aLECSSQ 9T&KASXDCLCN 
121/U 151/51 

*11 TCC xCC TCC ACA C\C AAT ACT AAA TCC TCC ACA ATA CAA CTC ATA AAA ATT CAA ATC 

a w *^7'->cnt?:ssr;iaik:c: 
iii/c: 311/11 

AC - AAA CTC CCC CTC CAA CAA CCA CTT AAC ATT ACC ACC CAC CTT ATT AAA CAA CTT 

t: l' a l z q a p n I s r d v I a o l 
an/si 271/91 

c — CCT CTT CCC CTC CAC CAA CTC ATT CAT CAC TAT CAC CTC CAC ACA CAC CAC 

C*"?~"xA??LC ZLZZQXOVQIZ 0 

3o:/::: 33i/m 

AC - ACC CAT CCC TCT TTC CAA CAC CAT CAC TAT CAT CCC ACA ACC CAA ACC ATT ATC ACA 
S W " S ~DGSLZ D ODTKATTETi: T 
-» 5V12 l 351/131 

zz - xiZ CAC TCT CAT TTT CTT CTA OA ATC CAC CCA AAA CCA AAA TCT TCC TTC TTT 
w*" ?~ - Z S 2 ? LVQX£CJC?JtCC-J 

V21/H1 * 4S1/1SI 

=• ACC TCT AAA ATA CAA TAT AAC AAA CTA CTA AAC CCA CAA TTA TCC ATA TAC TTC 

x " r"" S S X IQ V NXVVR A Q tW IY L 

<ai/ls: sii/171 

x — C' V CTC CU AAA CTT AC\ ACC CTC TTT CTC CAC ATC CTC ACA CTC ATT AAA CCC ATC 

t~ - -/ - ■ s ? t T V ' ? V Q C X L : X ? - K 

Sil/Iil £71/151 

AAA CAC CCT ACA ACA TAT ACT CCA ATT -CCA TCT TTC AAA CTT CAC ATC AAC CCA CCC ACT 
XCCT?-TT C :aSLXLCKN?CT 
M1/2C1 S21/211 

c — x - c TCC CAC ACT ATT CAT CTC AAC ACA CTC TTC CAA AAT TCC CTC AAA CAC CCT CAA 
c'lWQ S 2 2> V KTVLC S * R C ? 2 

65-/""l 531/231 

— C AAT TTA CCC ATC CAA ATA AAA CCT TTT CAT CAC AAT CCA CCA CAT CTT CCT CTA ACA 
1"* S * C'2:*Ac2SNGRDLAVT 
721/241 7S1/251 

— CCA CCA CC" CAA CAT CCA CTC AAC CCA TTT TTA CAC CTC ACA CTT ACA CA- A -A 

t c 7 c 1 r c l h ? i s v a v t c t 

Til/Ul 511/171 

V-\ CCC TCT CCC ACA CAT TTT CCC CTT CAT TCC CAC CAC CAC *-A ACC CAA TC. C-~ 
z - P. S ?. A C T CICCCIKSTIS?. 
S*I/2H 971/291 

T=T T: — c « taC CTC CTC ACA CTC CAT TTT CAA CCT TTT CCA TCC CAC TC^ A. . A.^ 
c"cRY?lTV CrZA ? C W S W 2 I A 
901/3C1 531/311 

CCT AAA ACA TAC AAA CCC AAT TAC TCC TCT CCA CAA TCT CAA TTC CTA TTT CTA CAC AAA 

? k*t7. AJircscscsrvFLCS 

951/221 991/331 

-AC CCC CAC ACT CAC CTC CTA CAC CAA CCA AAT CCA ACA CCC TC\ CCA CCC CCT TCC TCC 

Y " 7" X T H iv H T A ? R C S A C ? C C 

.* ... 12 5 1/2 Si 

ACa'cCc'aCC AAC ATC TCC C2T ATA AAC ATC CTC TAT TTC AAT CCA AAA CAA CAA ATA A 

t ? t k r. s p 1 N m 1 r t y c a t c : : 
;:2i/2s: i:ii/3t: _ 

TAT CCA AAC ATA CCA CCC ATC CTT CTA CAT CCT TCC CCC TCC . — * TCA 

rex; ? a m v v crcccs 
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CTC CAA ATC TAT BgT TAT ATT TAC CTC TTT ATG CTC ATT CTT CCT CC 



ATC CAA AAA CTC CAA ATC TAT ACT TAT ATT TAC C1^> J IT ATG CTC ATT CTT CCT CCC 
M O K 

«l/2l ^ 91/31 

CTC CAT CTC AAT CAC AAC ACC CAC CAA AAC CAA AAT CTC CAA AAA CAC CCC CTC TCT AAT 
V 0 L N E N S £ Q K E M V £ K £ C L 'C N 
121/<1 1S1/S1 

CCA TCT ATC TCC ACA CAA AAC ACT AAA TCT TCA ACA CTA CAA CCC ATA AAA ATT CAA ATC 
ACMW30NTKSSRLEAIKIQI 
181/61 211/71 

CTC ACT AAA CTT ZZZ CTC CAA ACA CCT CCT AAC ATT ACC AAA CAT CCT ATA ACA CAA CTT 

LSKLSLETAPNISKD-AIRQL 
241/81 271/91 

TTC CCC AAA CCT CCT CCA CTC CCC CAA CTC ATT CAT CAC TAC CAT CTC CAC ACA CAT CAC 

LPKArPLRELlDQYDVQRDD 
301/101 331/111 

ACC ACT CAT CCC TCC TTC CAA CAT CAT CAT TAT CAC CCT ACC ACC CAA ACC ATC ATT ACC 
SSOCSLED DDYHATTETI IT 
361/121 391/131 

ATC CCT ACA CAC TCT CAT CTT CTA ATC CAA CTC CAA CCA AAA CCC AAA TCC TCC TTC TTT 

MPTESOLLMQVECKPKCCFF 
<21/141 4S1/151 

AAA TTT ACC TCT AAA ATA CAA TAC AAT AAA CTA CTA AAC CCC CAA CTC TCC ATA TAT CTC 
KFSSXIQYNKVVKACLWIYl 
431/161 £11/171 

ACA CCC CTC AAC ACT CCT ACA ACA CTC TTT CTC CAA ATC CTC ACA CTC ATC AAA CCC ATC 
?. PVKTPTTVfVQILRLIiCPM 
£41/181 S71/191 

AAA CAC CCT ACA ACC TAT ACT CCA ATC CCA TCT CTC AAA CTT CAC ATC AAC CCA CCC ACT 
XOCTRYTCIRSLKLDMKPCT 
601/201 631/211 

cct att tcc cac acc att cat ctc aac aca ctc ttc caa aat tcc ctc aaa caa cct caa 
c:wqs:dvktvlqnwlxcpe 

661/221 c?l/231 

TCC AAC TTA CCC ATT CAA ATC AAA CCT TTA CAT CAC AAT CCT CAT CAT CTT CCT CTA ACC 

s l c : z i x a l d e n c k d l a v t 

7 : 1/2 41 751/251 

TTC CCA CCA CCA CCA CAA CAT CCC CTC AAT CCC TTT TTA CAA CTC AAC CTA ACA CAC ACA 
FPCrGEDC LMPrLEVKVTDT 
791/251 811/271 

CCA AAA ACA TTC ACC ACA CAT TTT CCA CTC CAC TCT CAT CAC CAC TCA ACA CAA TCT CCA 
?XRS?. ROFCLOCOEKSTESR 
341/2ai 871/291 

- - - - -> - .A. — . C.A ACT CTC . TTT CAA CCT TTT CCA TCC CAC TCC ATT ATT CCA 

CCRYrLTVOFEAFCWOWI I A 
901/301 * 931/311 

;.:A 7AT AAC CCC AAT TAC TCC TCT CCA CAC TCT CAA TTT CTA TTT TTA CAA AAA 

? K AYXANY C scecz fvflqk 
351/321 991/331 

TAC CCT CAC ACT CAT CTT CTC CAC CAA CCA AAC CCC ACA CCT TCA CCA CCC CCC TCC TCT 

? K T a L V K Q A K F a - G S A C p C C 
1C21/3«1 10S1/3S1 

ACT CCC ACA AAC ATC TCT CCA ATC AAT ATC CTA TAT TTT AAT CCC AAA CAA CAA ATA ATA 
TrTXM SPINKLYFNCXEQII 
1C31/351 1111/371 

TAT CCC AAA ATT CCA CCC ATC CTA CTA CAT CCC TCT CCC TCC TCA TCA 
r c k : PAMVVDRCCCS* 

Porcine CDF-5 
FIGURE U5 
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X7C CAA K\JK CTC CAA ATC TTT CTT TAT ATT TAC CTA TTT ATC CTG CTT CTT CCT CCC CCA 

wj'q KLC * F V T IY LFMLLVACP 
61/21 91/31 
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CCA TCC TTC TCC ACA C.-A AAC AAT AAA TCT TCA ACA CTA CAA CCC ATA AAA ATC CAA ATC 

k ' C L V * C W ^ K S S R L E A I K I C I 

•31/51 

CTC ACT AAC CTT CCC CTC CAA ACA CCT CCT AAC ATC ACC AAA CAT CCT ATA ACA CAA CTT 

r* W 5 K L = - E T A ? N I S K C A I R C L 

:r./ai 27i/9i 

— z zzz p - - c -- «- - — CTC CCC CAA CTC ATT CAT CAC TAC GAT CTC CAC ACA CAT CAC 
• * W =" X A r~ .= - ^ £ L Z DC Y C V Q a 0 0 

J:i/ici 221/111 

ACC \CC G-C CCC TTT TTC CAA CAC CAT CAC TAC CAC CTT ACC ACC CAA ACC CTC ATT ACC 
s " CSLECC DYKVTTETVIT 

251/121 391/131 

V~C CCC ACC CAC TCT GAT CTT CTA CCA CAA CTC CAA CAA AAA CCC AAA TCT TCC TTC TTT 
v"?"t£SC L LA EVQEK ? X C C F F 
V; ;/i4i 4S1/1S1 

: 1^ ~ ACC TCT AAC ATA CAA CAC AAT AAA CTA CTA AAG CCC CAA CTC TCC ATA TAT CTG 

x ~ ? s s x : c h m x v v :< a q l w : y l 

4 31/151 511/171 

ACA CCT CTC AAG ACT CCT ACA ACA CTC TTT CTC CAA ATC CTC ACA CTC ATC AAA CCC ATG 

s ? V 3C 7 ? T T. VFVC1LRLIXPK 
=41/131 " 571/191 

ill CAC CCT ATA ACC TAT A IT CCA ATC CCA TTT CTC AAA CTT CAC ATC AAC CTA CCC ACT 

- C T ?. Y T C I a S L X L Z N ? C T 

$:i/:ci c3i/2ii 

GC-* \— ~ZZ C\C ACC ATT CAT CTC AAC ACA CTG TTC CAA AAC TCC CTC AAA CAA CCT CAA 
c "-WQSlOV.K7VtCMWLXQ?E 
6al/::i 691/231 

~C AAC TTA CCC ATT CAA ATC AAA CCT TTA CAT GAG AAT CCT CAT CAT CTT CCT CTA ACC 

s"" 2i i c ■ i e : x a l o z m c k z l a v t 

7'-/-4* 751/2E1 

— - CCA CAA CCA CCA CAA GAA CCA CTC AAT CCT TTT TTA CAA CTC AAC CTA ACA CAC ACA 
r 3 T0 C ££GLM?fLEVKVT0T 
731/251 811/271 

c -* 1 1 \ }z- ACC ACA CAT TTT CCC CTT CAT TCT CAT CAC CAT TCC ACA CAA TCT CCA 

s " s. S ?. R C 5 C L C C C E H S T E S R 

541/231 871/291 _ _ 

TCC TCT CCT TAC CCT CTA ACT CTC CAT TTT CAA G^ . TTT CCA TCC CAT TCC A* * A*. CCA 
C c ?. V r L T V D F E A F C W C W I I A 
921/3C1 931/311 

CCT AAA ACA TAT AAG CCC AAT TAC TCC TCT CCA CAA TCT CAA TTT TTA TTT TTC CAA AAC 
^r^vv^NYcsGECEF Lr LQK 
Sol/321 " 991/331 

-\T CCT CAT ACC CAT CTT CTC CAC CAA CCA AAC CCC AAA CCT TCA CCC CCC CtT TCC TCT 
•✓pKTKLVKCAtfPKCSACPCC 
1C 21/2 41 1031/3S1 

-IT CCT A.CA AAC ATC TCT CCA ATT AAT ATC CTA TAT TTT AAT CCC AAA CAA OA ATA ATA 
- ? T~ x" MS? 1 N M S- Y F N C K E C Z I 

i:a°./3?: 1111/371 

TAT CCC AAC ATT CTA CCC ATC CTA CTA CAT CCC TCT CCC TCC TCA TCA 
C v i ? c M V V C R C C C S 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 
(i) APPLICANT: Se-Jin Lee et al . , 

(ii) TITLE OF THE INVENTION: GROWTH DIFFERENTIATION FACTOR- 8 

(iii) NUMBER OF SEQUENCES: 27 

( iv) CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: Fish & Richardson P.C. 

(B) STREET: 4225 Executive Square, Suite 1400 
<C) CITY: La Jolla 

(D) STATE: CA 

(E) COUNTRY: US 

(F) ZIP: 92037 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: Windows 9 5 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 05 -February- 1999 

( C ) CLASS IFICATION : 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/019 , 070 

(B) FILING DATE: 05 -February 1999 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/124 , 180 

(B) FILING DATE:28-July-1998 

(viii) ATTORNEY/ AGENT INFORMATION : 

(A) NAME: Lisa A. Haile, Ph.D. 

(B) REGISTRATION NUMBER: 38,347 

(C) REFERENCE/DOCKET NUMBER: 07265/154WO1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 619/678-5070 

(B) TELEFAX: 619/678-5099 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: SJL141 

(ix) FEATURE: 

(A) NAME /KEY : Modified Base 

(B) LOCATION: 1...35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
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CCGGAATTCG GNTGGVANRA YTGGRTNRTN NKCNCC 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE : Genomic DNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: SJL147 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. . .33 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
CCGGAATTCR CANSCRCARC TNTCNACNRY CAT 33 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: ACM 13 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 1...32 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CGCGGATC C A GAAGTCAAGG TGACAGACAC AC 32 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: ACM 14 

(ix) FEATURE: 

(A) NAME /KEY : CDS 
<B) LOCATION: 1 . . .33 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



WO 99/401i>. 



PCT/US99/0251 1 



-3- 

CGCGGATCCT CCTCATGAGC ACCCACAGCG GTC 33 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 550 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: mouse GDF-8 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 59... 436 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

TTAAGGTAGG AAGGATTTCA GGCTCTATTT ACATAATTGT TCTTTCCTTT TCACACAG 58 

AAT CCC TTT TTA GAA GTC AAG GTG ACA GAC ACA CCC AAG AGG TCC CGG 106 
Asn Pro Phe Leu Glu Val Lys Val Thr Asp Thr Pro Lys Arg Ser Arg 
15 10 15 

AGA GAC TTT GGG CTT GAC TGC GAT GAG CAC TCC ACG GAA TCC CGG TGC 154 
Arg Asp Phe Gly Leu Asp Cys Asp Glu His Ser Thr Glu Ser Arg Cys 
20 25 30 

TGC CGC TAC CCC CTC ACG GTC GAT TTT GAA GCC TTT GGA TGG GAC TGG 2 02 
Cys Arg Tyr Pro Leu Thr Val Asp Phe Glu Ala Phe Gly Trp Asp Trp 
35 40 45 

ATT ATC GCA CCC AAA AGA TAT AAG GCC AAT TAC TGC TCA GGA GAG TGT 2 50 

lie lie Ala Pro Lys Arg Tyr Lys Ala Asn Tyr Cys Ser Gly Glu Cys 
50 55 60 

GAA TTT GTG TTT TTA CAA AAA TAT CCG CAT ACT CAT CTT GTG CAC CAA 2 98 
Glu Phe Val Phe Leu Gin Lys Tyr Pro His Thr His Leu Val His Gin 
65 70 75 80 

GCA AAC CCC AGA GGC TCA GCA GGC CCT TGC TGC ACT CCG ACA AAA ATG 346 
Ala Asn Pro Arg Gly Ser Ala Gly Pro Cys Cys Thr Pro Thr Lys Met 

85 90 95 

TCT CCC ATT AAT ATG CTA TAT TTT AAT GGC AAA GAA CAA ATA ATA TAT 394 
Ser Pro lie Asn Met Leu Tyr Phe Asn Gly Lys Glu Gin lie lie Tyr 
100 105 110 

GGG AAA ATT CCA GCC ATG GTA GTA GAC CGC TGT GGG TGC TCA TGAGCTTTGC 446 
Gly Lys lie Pro Ala Met Val Val Asp Arg Cys Gly Cys Ser 
115 120 125 

ATTAGGTTAG AAACTTCCCA AGTCATGGAA GGTCTTCCCC TCAATTTCGA AACTGTGAAT 506 
TCCTGCAGCC CGGGGGATCC ACTAGTTCTA GAGCGGCCGC CACC 55 0 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 
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(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 6 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : S ingle 

( D ) TOPOLOGY : 1 inear 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: human GDF-8 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 3... 326 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CA AAA AGA TCC AGA AGG GAT TTT GGT CTT GAC TGT GAT GAG CAC TCA 47 
Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys Asp Glu His Ser 
15 10 15 

ACA GAA TCA CGA TGC TGT CGT TAC CCT CTA ACT GTG GAT TTT GAA GCT 95 
Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val Asp Phe Glu Ala 
20 25 30 

TTT GGA TGG GAT TGG ATT ATC GCT CCT AAA AGA TAT AAG GCC AAT TAC 143 
Phe Gly Trp Asp Trp He He Ala Pro Lys Arg Tyr Lys Ala Asn Tyr 
35 40 45 

TGC TCT GGA GAG TGT GAA TTT GTA TTT TTA CAA AAA TAT CCT CAT ACT 191 
Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin Lys Tyr Pro His Thr 
50 55 60 

CAT CTG GTA CAC CAA GCA AAC CCC AGA GGT TCA GCA GGC CCT TGC TGT 23 9 

His Leu Val His Gin Ala Asn Pro Arg Gly Ser Ala Gly Pro Cys Cys 
65 70 75 
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ACT CCC ACA AAG ATG TCT CCA ATT AAT ATG CTA TAT TTT AAT GGC AAA 287 
Thr Pro Thr Lys Met Ser Pro lie Asn Met Leu Tyr Phe Asn Gly Lys 
80 85 90 95 

GAA CAA ATA ATA TAT GGG AAA ATT CCA GCG ATG GTA GTA 326 
Glu Gin lie lie Tyr Gly Lys lie Pro Ala Met Val Val 

100 105 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



Lys 


Arg 


Ser 


Arg 


Arg 


Asp 


Phe 


Gly 


Leu 


Asp 


Cys 


Asp 


Glu 


His 


Ser 


Thr 


1 








5 










10 










15 




Glu 


Ser 


Arg 


Cys 
20 


Cys 


Arg 


Tyr 


Pro 


Leu 
25 


Thr 


Val 


Asp 


Phe 


Glu 
30 


Ala 


Phe 


Gly 


Trp 


Asp 
35 


Trp 


lie 


He 


Ala 


Pro 
40 


Lys 


Arg 


Tyr 


Lys 


Ala 
45 


Asn 


Tyr 


Cys 


Ser 


Gly 
50 


Glu 


Cys 


Glu 


Phe 


Val 
55 


Phe 


Leu 


Gin 


Lys 


Tyr 
60 


Pro 


His 


Thr 


His 


Leu 


Val 


His 


Gin 


Ala 


Asn 


Pro 


Arg 


Gly 


Ser 


Ala 


Gly 


Pro 


Cys 


Cys 


Thr 


65 










70 










75 










80 


Pro 


Thr 


Lys 


Met 


Ser 


Pro 


He 


Asn 


Met 


Leu 


Tyr 


Phe 


Asn 


Gly 


Lys 


Glu 








85 










90 










95 




Gin 


lie 


lie 


Tyr 
100 


Gly 


Lys 


He 


Pro 


Ala 
105 


Met 


Val 


Val 











(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: SJL141 

<ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1. . .9 

(D) OTHER INFORMATION: "Xaa at position 3 = His, Gin, Asn, Lys, 
Asp, or Glu; Xaa at position 4 = Asp or Asn; Xaa at positions 6 and 7 is 
Val, He, or Met; Xaa at position 8 = Ala or Ser. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 

Gly Trp Xaa Xaa Trp Xaa Xaa Xaa Pro 
1 5 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 



BNSDOCID: < WO 9940 1 6 1 A 1 J _> 



WO 99/40181 



PCTVUS99/0251 1 



(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: SJL147 

(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 ... 8 

(D) OTHER INFORMATION: "Xaa at position 2 = Val, lie, Met, Thr or 
Ala; Xaa at position 4 = Asp or Glu; Xaa at position 7 = Gly, or Ala. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Xaa Val Xaa Ser Cys Xaa Cys 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 67 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vii) IMMEDIATE SOURCE: 

<B) CLONE: Murine GDF-8 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1...2676 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GTCTCTCGGA CGGTACATGC ACTAATATTT CACTTGGCAT TACTCAAAAG CAAAAAGAAG 60 
AAATAAGAAC AAGGGAAAAA AAAAGATTGT GCTGATTTTT AAA ATG ATG CAA AAA 115 



1 



5 



Met Met Gin Lys 
1 



CTG CAA ATG TAT GTT TAT ATT TAC CTG TTC ATG CTG ATT GCT GCT GGC 
Leu Gin Met Tyr Val Tyr lie Tyr Leu Phe Met Leu lie Ala Ala Gly 
5 10 15 20 



163 



CCA GTG GAT CTA AAT GAG GGC AGT GAG AGA GAA GAA AAT GTG GAA AAA 
Pro Val Asp Leu Asn Glu Gly Ser Glu Arg Glu Glu Asn Val Glu Lys 
25 30 35 



211 



GAG GGG CTG TGT AAT GCA TGT GCG TGG AGA CAA AAC ACG AGG TAC TCC 
Glu Gly Leu Cys Asn Ala Cys Ala Trp Arg Gin Asn Thr Arg Tyr Ser 
40 45 50 



259 



AGA ATA GAA GCC ATA AAA ATT CAA ATC CTC AGT AAG CTG CGC CTG GAA 
Arg lie Glu Ala lie Lys lie Gin lie Leu Ser Lys Leu Arg Leu Glu 
55 60 65 



307 



ACA GCT CCT AAC ATC AGC AAA GAT GCT ATA AGA CAA CTT CTG CCA AGA 
Thr Ala Pro Asn lie Ser Lys Asp Ala lie Arg Gin Leu Leu Pro Arg 
70 75 80 



355 
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GCG CCT CCA CTC CGG GAA CTG ATC GAT CAG TAC GAC GTC CAG AGG GAT 403 

Ala Pro Pro Leu Arg Glu Leu lie Asp Gin Tyr Asp Val Gin Arg Asp 

85 90 95 100 

GAC AGC AGT GAT GGC TCT TTG GAA GAT GAC GAT TAT CAC GCT ACC ACG 4 51 

Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His Ala Thr Thr 
105 110 115 

GAA ACA ATC ATT ACC ATG CCT ACA GAG TCT GAC TTT CTA ATG CAA GCG 4 99 

Glu Thr lie lie Thr Met Pro Thr Glu Ser Asp Phe Leu Met Gin Ala 
120 125 130 

GAT GGC AAG CCC AAA TGT TGC TTT TTT AAA TTT AGC TCT AAA ATA CAG 54 7 

Asp Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser Lys lie Gin 
135 140 145 

TAC AAC AAA GTA GTA AAA GCC CAA CTG TGG ATA TAT CTC AGA CCC GTC 595 
Tyr Asn Lys Val Val Lys Ala Gin Leu Trp He Tyr Leu Arg Pro Val 
150 155 160 

AAG ACT CCT ACA ACA GTG TTT GTG CAA ATC CTG AGA CTC ATC AAA CCC 643 
Lys Thr Pro Thr Thr Val Phe Val Gin He Leu Arg Leu He Lys Pro 
165 170 175 180 

ATG AAA GAC GGT ACA AGG TAT ACT GGA ATC CGA TCT CTG AAA CTT GAC 691 
Met Lys Asp Gly Thr Arg Tyr Thr Gly lie Arg Ser Leu Lys Leu Asp 
185 190 195 

ATG AGC CCA GGC ACT GGT ATT TGG CAG AGT ATT GAT GTG AAG ACA GTG 73 9 

Met Ser Pro Gly Thr Gly He Trp Gin Ser He Asp Val Lys Thr Val 
200 205 210 

TTG CAA AAT TGG CTC AAA CAG CCT GAA TCC AAC TTA GGC ATT GAA ATC 787 
Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu Gly He Glu He 
215 220 225 

AAA GCT TTG GAT GAG AAT GGC CAT GAT CTT GCT GTA ACC TTC CCA GGA 835 
Lys Ala Leu Asp Glu Asn Gly His Asp Leu Ala Val Thr Phe Pro Gly 
230 235 240 

CCA GGA GAA GAT GGG CTG AAT CCC TTT TTA GAA GTC AAG GTG ACA GAC 883 
Pro Gly Glu Asp Gly Leu Asn Pro Phe Leu Glu Val Lys Val Thr Asp 
245 250 255 260 

ACA CCC AAG AGG TCC CGG AGA GAC TTT GGG CTT GAC TGC GAT GAG CAC 931 
Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys Asp Glu His 
265 270 275 

TCC ACG GAA TCC CGG TGC TGC CGC TAC CCC CTC ACG GTC GAT TTT GAA 97 9 

Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val Asp Phe Glu 
280 285 290 

GCC TTT GGA TGG GAC TGG ATT ATC GCA CCC AAA AGA TAT AAG GCC AAT 102 7 

Ala Phe Gly Trp Asp Trp He He Ala Pro Lys Arg Tyr Lys Ala Asn 
295 300 305 

TAC TGC TCA GGA GAG TGT GAA TTT GTG TTT TTA CAA AAA TAT CCG CAT 1075 
Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin Lys Tyr Pro His 
310 315 320 

ACT CAT CTT GTG CAC CAA GCA AAC CCC AGA GGC TCA GCA GGC CCT TGC 1123 
Thr His Leu Val His Gin Ala Asn Pro Arg Gly Ser Ala Gly Pro Cys 
325 330 335 340 
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TGC ACT CCG AC A AAA ATG TCT CCC ATT AAT ATG CTA TAT TTT AAT GGC 1171 
Cys Thr Pro Thr Lys Met Ser Pro lie Asn Met Leu Tyr Phe Asn Gly 
345 350 355 

AAA GAA CAA ATA ATA TAT GGG AAA ATT CCA GCC ATG GTA GTA GAC CGC 1219 
Lys Glu Gin lie lie Tyr Gly Lys lie Pro Ala Met Val Val Asp Arg 
360 365 370 

TGT GGG TGC TCA TGAGCTTTGC ATTAGGTTAG AAACTTCCCA AGTCATGGAA GGTCT 12 76 
Cys Gly Cys Ser 
375 

TCCCCTCAAT TTCGAAACTG TGAATTCAAG CACCACAGGC TGTAGGCCTT GAGTATGCTC 13 36 

TAGTAACGTA AGCACAAGCT ACAGTGTATG AACTAAAAGA GAGAATAGAT GCAATGGTTG 1396 

GCATTCAACC ACCAAAATAA ACCATACTAT AGGATGTTGT ATGATTTCCA GAGTTTTTGA 14 56 

AATAGATGGA GATCAAATTA CATTTATGTC CATATATGTA TATTACAACT ACAATCTAGG 1516 

CAAGGAAGTG AGAGCACATC TTGTGGTCTG CTGAGTTAGG AGGGTATGAT TAAAAGGTAA 1576 

AGTCTTATTT CCTAACAGTT TCACTTAATA TTTACAGAAG AATCTATATG TAGCCTTTGT 1636 

AAAGTGTAGG ATTGTTATCA TTTAAAAACA TCATGTACAC TTATATTTGT ATTGTATACT 1696 

TGGTAAGATA AAATTCCACA AAGTAGGAAT GGGGCCTCAC ATACACATTG CCATTCCTAT 1756 

TATAATTGGA CAATCCACCA CGGTGCTAAT GCAGTGCTGA ATGGCTCCTA CTGGACCTCT 1816 

CGATAGAACA CTCTACAAAG TACGAGTCTC TCTCTCCCTT CCAGGTGCAT CTCCACACAC 1876 

ACAGCACTAA GTGTTCAATG CATTTTCTTT AAGGAAAGAA GAATCTTTTT TTCTAGAGGT 1936 

CAACTTTCAG TCAACTCTAG CACAGCGGGA GTGACTGCTG CATCTTAAAA GGCAGCCAAA 1996 

CAGTATTCAT TTTTTAATCT AAATTTCAAA ATCACTGTCT GC CTTTATCA CATGGCAATT 2056 

TTGTGGTAAA ATAATGGAAA TGACTGGTTC TATCAATATT GTATAAAAGA CTCTGAAACA 2116 

ATTACATTTA TATAATATGT ATACAATATT GTTTTGTAAA TAAGTGTCTC CTTTTATATT 2176 

TACTTTGGTA TATTTTTACA CTAATGAAAT TTCAAATCAT TAAAGTACAA AGACATGTCA 2236 

TGTATCACAA AAAAGGTGAC TGCTTCTATT TCAGAGTGAA TTAGCAGATT CAATAGTGGT 22 96 

CTTAAAACTC TGTATGTTAA GATTAGAAGG TTATATTACA ATCAATTTAT GTATTTTTTA 2356 

CATTATCAAC TTATGGTTTC ATGGTGGCTG TATCTATGAA TGTGGCTCCC AGTCAAATTT 2416 

CAATGCCCCA CCATTTTAAA AATTACAAGC ATTACTAAAC ATACCAACAT GTATCTAAAG 2476 

AAATACAAAT ATGGTATCTC AATAACAGCT ACTTTTTTAT TTTATAATTT GACAATGAAT 2536 

ACATTTCTTT TATTTACTTC AGTTTTATAA ATTGGAACTT TGTTTATCAA ATGTATTGTA 25 96 

TCATAGCTA AATGAAATTA TTTCTTACAT AAAAATGTGT AGAAACTATA AATTAAAGTG 2656 

TTTTCACATT TTTGAAAGGC 2 676 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 76 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) . FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



Met 


Met 


Gin 


Lys 


Leu 


Gin 


Met 


Tyr 


Val 


Tyr 


He 


Tyr 


Leu 


Phe 


Met 


Leu 


1 








5 










10 










15 




lie 


Ala 


Ala 


Gly 


Pro 


Val 


Asp 


Leu 


Asn 


Glu 


Gly 


Ser 


Glu 


Arg 


Glu 


Glu 








20 










25 










30 






Asn 


Val 


Glu 


Lys 


Glu 


Gly 


Leu 


Cys 


Asn 


Ala 


Cys 


Ala 


Trp 


Arg 


Gin 


Asn 






35 










40 










45 








Thr 


Arg 


Tyr 


Ser 


Arg 


lie 


Glu 


Ala 


He 


Lys 


He 


Gin 


He 


Leu 


Ser 


Lys 




50 










55 










60 










Leu 


Arg 


Leu 


Glu 


Thr 


Ala 


Pro 


Asn 


He 


Ser 


Lys 


Asp 


Ala 


He 


Arg 


Gin 


65 










70 










75 










80 


Leu 


Leu 


Pro 


Arg 


Ala 


Pro 


Pro 


Leu 


Arg 


Glu 


Leu 


He 


Asp 


Gin 


Tyr 


Asp 










85 










90 










95 




Val 


Gin 


Arg 


Asp 


Asp 


Ser 


Ser 


Asp 


Gly 


Ser 


Leu 


Glu 


Asp 


Asp 


Asp 


Tyr 








100 










105 










110 






His 


Ala 


Thr 


Thr 


Glu 


Thr 


He 


He 


Thr 


Met 


Pro 


Thr 


Glu 


Ser 


Asp 


Phe 



115 120 125 
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Leu Met Gin Ala Asp Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser 

130 135 140 

Ser Lys He Gin Tyr Asn Lys Val Val Lys Ala Gin Leu Trp He Tyr 
145 150 155 160 

Leu Arg Pro Val Lys Thr Pro Thr Thr Val Phe Val Gin He Leu Arg 

165 170 1*75 

Leu He Lys Pro Met Lys Asp Gly Thr Arg Tyr Thr Gly He Arg Ser 

180 185 190 

Leu Lys Leu Asp Met Ser Pro Gly Thr Gly He Trp Gin Ser He Asp 

195 200 205 

Val Lys Thr Val Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu 

210 215 220 

Gly He Glu He Lys Ala Leu Asp Glu Asn Gly His Asp Leu Ala Val 
225 230 235 240 

Thr Phe Pro Gly Pro Gly Glu Asp Gly Leu Asn Pro Phe Leu Glu Val 

245 250 255 

Lys Val Thr Asp Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp 

260 265 270 

Cys Asp Glu His Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr 

275 280 285 

Val Asp Phe Glu Ala Phe Gly Trp Asp Trp He He Ala Pro Lys Arg 

290 295 300 

Tvr Lvs Ala Asn Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin 
305 310 315 320 

Lvs Tyr Pro His Thr His Leu Val His Gin Ala Asn Pro Arg Gly Ser 

325 330 335 

Ala Gly Pro Cys Cys Thr Pro Thr Lys Met Ser Pro He Asn Met Leu 

340 345 350 

Tyr Phe Asn Gly Lys Glu Gin He He Tyr Gly Lys He Pro Ala Met 

355 360 365 

Val Val Asp Arg Cys Gly Cys Ser 
370 375 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2743 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Human GDF-8 

<ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1...2743 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

AAGAAAAGTA AAAGGAAGAA ACAAGAACAA GAAAAAAGAT TATATTGATT TTAAAATC 58 

ATG CAA AAA CTG CAA CTC TGT GTT TAT ATT TAC CTG TTT ATG CTG ATT 106 
Met Gin Lys Leu Gin Leu Cys Val Tyr He Tyr Leu Phe Met Leu He 
15 10 15 

GTT GCT GGT CCA GTG GAT CTA AAT GAG AAC AGT GAG CAA AAA GAA AAT 154 
Val Ala Gly Pro Val Asp Leu Asn Glu Asn Ser Glu Gin Lys Glu Asn 
20 25 30 
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GTG GAA AAA GAG GGG CTG TGT AAT GCA TGT ACT TGG AGA CAA AAC ACT 202 
Val Glu Lys Glu Gly Leu Cys Asn Ala Cys Thr Trp Arg Gin Asn Thr 
35 40 45 

AAA TCT TCA AGA ATA GAA GCC ATT AAG ATA CAA ATC CTC AGT AAA CTT 250 
Lys Ser Ser Arg lie Glu Ala lie Lys lie Gin lie Leu Ser Lys Leu 
50 55 60 

CGT CTG GAA ACA GCT CCT AAC ATC AGC AAA GAT GTT ATA AGA CAA CTT 2 98 

Arg Leu Glu Thr Ala Pro Asn lie Ser Lys Asp Val lie Arg Gin Leu 
65 70 75 80 

TTA CCC AAA GCT CCT CCA CTC CGG GAA CTG ATT GAT CAG TAT GAT GTC 34 6 

Leu Pro Lys Ala Pro Pro Leu Arg Glu Leu lie Asp Gin Tyr Asp Val 
85 90 95 

CAG AGG GAT GAC AGC AGC GAT GGC TCT TTG GAA GAT GAC GAT TAT CAC 3 94 

Gin Arg Asp Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His 
100 105 110 

GCT ACA ACG GAA ACA ATC ATT ACC ATG CCT ACA GAG TCT GAT TTT CTA 44 2 

Ala Thr Thr Glu Thr lie lie Thr Met Pro Thr Glu Ser Asp Phe Leu 
115 120 125 

ATG CAA GTG GAT GGA AAA CCC AAA TGT TGC TTC TTT AAA TTT AGC TCT 4 90 

Met Gin Val Asp Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser 
130 135 140 

AAA ATA CAA TAC AAT AAA GTA GTA AAG GCC CAA CTA TGG ATA TAT TTG 53 8 

Lys lie Gin Tyr Asn Lys Val Val Lys Ala Gin Leu Trp lie Tyr Leu 
145 150 155 160 

AGA CCC GTC GAG ACT CCT ACA ACA GTG TTT GTG CAA ATC CTG AGA CTC 586 
Arg Pro Val Glu Thr Pro Thr Thr Val Phe Val Gin He Leu Arg Leu 
165 170 175 

ATC AAA CCT ATG AAA GAC GGT ACA AGG TAT ACT GGA ATC CGA TCT CTG 634 
He Lys Pro Met Lys Asp Gly Thr Arg Tyr Thr Gly He Arg Ser Leu 
180 185 190 

AAA CTT GAC ATG AAC CCA GGC ACT GGT ATT TGG CAG AGC ATT GAT GTG 682 
Lys Leu Asp Met Asn Pro Gly Thr Gly He Trp Gin Ser He Asp Val 
195 200 205 

AAG ACA GTG TTG CAA AAT TGG CTC AAA CAA CCT GAA TCC AAC TTA GGC 730 
Lys Thr Val Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu Gly 
210 215 220 

ATT GAA ATA AAA GCT TTA GAT GAG AAT GGT CAT GAT CTT GCT GTA ACC 7 78 

He Glu He Lys Ala Leu Asp Glu Asn Gly His Asp Leu Ala Val Thr 
225 230 235 240 

TTC CCA GGA CCA GGA GAA GAT GGG CTG AAT CCG TTT TTA GAG GTC AAG 82 6 

Phe Pro Gly Pro Gly Glu Asp Gly Leu Asn Pro Phe Leu Glu Val Lys 
245 250 255 

GTA ACA GAC ACA CCA AAA AGA TCC AGA AGG GAT TTT GGT CTT GAC TGT 874 
Val Thr Asp Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys 
260 265 270 

GAT GAG CAC TCA ACA GAA TCA CGA TGC TGT CGT TAC CCT CTA ACT GTG 922 
Asp Glu His Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val 
275 280 285 
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GAT TTT GAA GCT TTT GGA TGG GAT TGG ATT ATC GCT CCT AAA AGA TAT 970 
Asp Phe Glu Ala Phe Gly Trp Asp Trp lie lie Ala Pro Lys Arg Tyr 
290 295 300 

AAG GCC AAT TAC TGC TCT GGA GAG TGT GAA TTT GTA TTT TTA CAA AAA 1018 
Lys Ala Asn Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin Lys 
305 310 315 320 

TAT CCT CAT ACT CAT CTG GTA CAC CAA GCA AAC CCC AGA GGT TCA GCA 1066 
Tyr Pro His Thr His Leu Val His Gin Ala Asn Pro Arg Gly Ser Ala 
325 330 335 

GGC CCT TGC TGT ACT CCC ACA AAG ATG TCT CCA ATT AAT ATG CTA TAT 1114 
Gly Pro Cys Cys Thr Pro Thr Lys Met Ser Pro lie Asn Met Leu Tyr 
340 345 350 

TTT AAT GGC AAA GAA CAA ATA ATA TAT GGG AAA ATT CCA GCG ATG GTA 1162 
Phe Asn Gly Lys Glu Gin lie lie Tyr Gly Lys lie Pro Ala Met Val 
355 360 365 

GTA GAC CGC TGT GGG TGC TCA TGAGATTTAT ATTAAGCGTT CATAACTTCC TAAAA 1219 
Val Asp Arg Cys Gly Cys Ser 
370 375 

ATGGAAGGTT TTCCCCTCAA CAATTTTGAA GCTGTGAAAT TAAGTACCAC AGGCTATAGG 127 9 

CCTAGAGTAT GCTACAGTCA CTTAAGCATA AGCTACAGTA TGTAAACTAA AAGGGGGAAT 133 9 

ATATGCAATG GTTGGCATTT AAC CATC CAA ACAAATCATA CAAGAAAGTT TTATGATTTC 1399 

CAGAGTTTTT GAGCTAGAAG GAGATCAAAT TACATTTATG TTCCTATATA TTACAACATC 1459 

GGCGAGGAAA TGAAAGCGAT TCTCCTTGAG TTCTGATGAA TTAAAGGAGT ATGCTTTAAA 1519 

GTCTATTTCT TTAAAGTTTT GTTTAATATT TACAGAAAAA TCCACATACA GTATTGGTAA 1579 

AATGCAGGAT TGTTATATAC CATCATTCGA ATCATCCTTA AACACTTGAA TTTATATTGT 1639 

ATGGTAGTAT ACTTGGTAAG ATAAAATTCC ACAAAAATAG GGATGGTGCA GCATATGCAA 1699 

TTTCCATTCC TATTATAATT GACACAGTAC ATTAACAATC CATGCCAACG GTGCTAATAC 1759 

GATAGGCTGA ATGTCTGAGG CTACCAGGTT TATCACATAA AAAACATTCA GTAAAATAGT 1819 

AAGTTTCTCT TTTCTTCAGG TGCATTTTCC TACACCTCCA AATGAGGAAT GGATTTTCTT 1879 

TAATGTAAGA AGAATCATTT TTCTAGAGGT TGGCTTTCAA TTCTGTAGCA TACTTGGAGA 1939 

AACTGCATTA TCTTAAAAGG CAGTCAAATG GTGTTTGTTT TTATCAAAAT GTCAAAATAA 1999 

CATACTTGGA GAAGTATGTA ATTTTGTCTT TGGAAAATTA CAACACTGCC TTTGCAACAC 2059 

TGCAGTTTTT ATGGTAAAAT AATAGAAATG ATCGACTCTA TCAATATTGT ATAAAAAGAC 2119 

TGAAACAATG CATTTATATA ATATGTATAC AATATTGTTT TGTAAATAAG TGTCTCCTTT 2179 

TTTATTTACT TTGGTATATT TTTACACTAA GGACATTTCA AATTAAGTAC TAAGGCACAA 223 9 

AGACATGTCA TG CATC AC AG AAAAGCAACT ACTTATATTT CAGAGCAAAT TAGCAGATTA 2299 

AATAGTGGTC TTAAAACTCC ATATGTTAAT GATTAGATGG TTATATTACA ATCATTTTAT 2359 

ATTTTTTTAC ATGATTAACA TTCACTTATG GATTCATGAT GGCTGTATAA AGTGAATTTG 2419 

AAATTTCAAT GGTTTACTGT CATTGTGTTT AAATCTCAAC GTTCCATTAT TTTAATACTT 2479 

GCAAAAACAT TACTAAGTAT ACCAAAATAA TTGACTCTAT TATCTGAAAT GAAGAATAAA 2539 

CTGATGCTAT CTCAACAATA ACTGTTACTT TTATTTTATA ATTTGATAAT GAATATATTT 2599 

CTGCATTTAT TTACTTCTGT TTTGTAAATT GGGATTTTGT TAATCAAATT TATTGTACTA 2659 

TGACTAAATG AAATTATTTC TTACATCTAA TTTGTAGAAA CAGTATAAGT TATATTAAAG 2719 

TGTTTTCACA TTTTTTTGAA AGAC 2743 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 375 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:14: 
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Met 


Gin 


Lys 


Leu 


Gin 


Leu 


Cys 


Val 


Tyr 


He 


Tyr 


Leu 


Phe 


Met 


Leu 


He 


1 






5 










10 










15 




Val 


Ala 


Gly 


Pro 


Val 


Asp 


Leu 


Asn 


Glu 


Asn 


Ser 


Glu 


Gin 


Lys 


Glu 


Asn 






20 










25 










30 






Val 


Glu 


Lys 
35 


Glu 


Gly 


Leu 


Cys 


Asn 
40 


Ala 


Cys 


Thr 


Trp 


Arg 
45 


Gin 


Asn 


Thr 


Lys 


Ser 


Ser 


Arg 


He 


Glu 


Ala 


He 


Lys 


He 


Gin 


He 


Leu 


Ser 


Lys 


Leu 


50 










55 










60 










Arg 


Leu 


Glu 


Thr 


Ala 


Pro 


Asn 


He 


Ser 


Lys 


Asp 


Val 


He 


Arg 


Gin 


Leu 


65 










70 










75 










80 


Leu 


Pro 


Lys 


Ala 


Pro 
85 


Pro 


Leu 


Arg 


Glu 


Leu 
90 


He 


Asp 


Gin 


Tyr 


Asp 
95 


Val 


Gin 


Arg 


Asp 


Asp 
100 


Ser 


Ser 


Asp 


Gly 


Ser 
105 


Leu 


Glu 


Asp 


Asp 


Asp 
110 


Tyr 


His 


Ala 


Thr 


Thr 
115 


Glu 


Thr 


He 


He 


Thr 
120 


Met 


Pro 


Thr 


Glu 


Ser 
125 


Asp 


Phe 


Leu 


Met 


Gin 
130 


Val 


Asp 


Gly 


Lys 


Pro 
135 


Lys 


Cys 


Cys 


Phe 


Phe 
140 


Lys 


Phe 


Ser 


Ser 


Lys 


He 


Gin 


Tyr 


Asn 


Lys 


Val 


Val 


Lys 


Ala 


Gin 


Leu 


Trp 


He 


Tyr 


Leu 


145 










150 










155 










160 


Arg 


Pro 


Val 


Glu 


Thr 


Pro 


Thr 


Thr 


Val 


Phe 


Val 


Gin 


He 


Leu 


Arg 


Leu 








165 










170 










175 




He 


Lys 


Pro 


Met 
180 


Lys 


Asp 


Gly 


Thr 


Arg 
185 


Tyr 


Thr 


Gly 


He 


Arg 
190 


Ser 


Leu 


Lys 


Leu 


Asp 


Met 


Asn 


Pro 


Gly 


Thr 


Gly 


He 


Trp 


Gin 


Ser 


He 


Asp 


Val 




195 










200 










205 








Lys 


Thr 


Val 


Leu 


Gin 


Asn 


Trp 


Leu 


Lys 


Gin 


Pro 


Glu 


Ser 


Asn 


Leu 


Gly 


210 










215 










220 










He 


Glu 


He 


Lys 


Ala 


Leu 


Asp 


Glu 


Asn 


Gly His 


Asp 


Leu 


Ala 


Val 


Thr 


225 










230 










235 










240 


Phe 


Pro 


Gly 


Pro 


Gly 


Glu 


Asp 


Gly 


Leu 


Asn 


Pro 


Phe 


Leu 


Glu 


Val 


Lys 








245 










250 










255 




Val 


Thr 


Asp 


Thr 


Pro 


Lys 


Arg 


Ser 


Arg 


Arg Asp 


Phe 


Gly 


Leu 


Asp 


Cys 








260 










265 










270 






Asp 


Glu 


His 


Ser 


Thr 


Glu 


Ser 


Arg 


Cys 


C Y S 


Arg 


Tyr 


Pro 


Leu 


Thr 


Val 




275 










280 










285 








Asp 


Phe 
290 


Glu 


Ala 


Phe 


Gly 


Trp 
295 


Asp 


Trp 


He 


He 


Ala 
300 


Pro 


Lys 


Arg 


Tyr 


Lys 


Ala 


Asn 


Tyr 


Cys 


Ser Gly Glu Cys 


Glu 


Phe 


Val 


Phe 


Leu 


Gin 


Lys 


305 










310 










315 










320 


Tyr 


Pro 


His 


Thr 


His 


Leu 


Val 


His 


Gin 


Ala 


Asn 


Pro 


Arg 


Gly 


Ser 


Ala 








325 










330 










335 




Gly 


Pro 


Cys 


Cys 
340 


Thr 


Pro 


Thr 


Lys 


Met 
345 


Ser 


Pro 


He 


Asn 


Met 
350 


Leu 


Tyr 


Phe 


Asn Gly 


Lys 


Glu 


Gin 


He 


He 


Tyr 


Gly 


Lys 


He 


Pro 


Ala 


Met 


Val 






355 










360 










365 








Val 


Asp 
370 


Arg 


Cys 


Gly 


Cys 


Ser 
375 





















(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: #83 



WO 99/4016. 
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(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1 . . 34 

(C) OTHER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CGCGGATCCG TGGATCTAAA TGAGAACAGT GAGC 34 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: #84 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1. . .37 

(C) OTHER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CGCGAATTCT CAGGTAATGA TTGTTTCCGT TGTAGCG 37 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single- 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: #100 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. . .20 

(C) OTHER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AC AC TAAATC TTCAAGAATA 20 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1055 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vii) IMMEDIATE SOURCE: 
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(B) CLONE: Baboon GDF-8 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1...1055 
(D) OTHER INFORMATION: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 



ATG CAA AAA 9 
Met Gin Lys 
1 

CTG CAA CTC TGT GTT TAT ATT TAC CTG TTT ATG CTG ATT GTT GCT GGT 54 
Leu Gin Leu Cys Val Tyr lie Tyr Leu Phe Met Leu lie Val Ala Gly 
5 10 15 20 

CCA GTG GAT CTA AAT GAG AAC AGT GAG CAA AAA GAA AAT GTG GAA AAA 95 
Pro Val Asp Leu Asn Glu Asn Ser Glu Gin Lys Glu Asn Val Glu Lys 
25 30 35 

GAG GGG CTG TGT AAT GCA TGT ACT TGG AGA CAA AAC ACT AAA TCT TCA 14 0 

Glu Gly Leu Cys Asn Ala Cys Thr Trp Arg Gin Asn Thr Lys Ser Ser 
40 45 50 

AGA ATA GAA GCC ATT AAA ATA CAA ATC CTC AGT AAA CTT CGT CTG GAA 185 
Arg lie Glu Ala lie Lys lie Gin lie Leu Ser Lys Leu Arg Leu Glu 
55 60 65 

ACA GCT CCT AAC ATC AGC AAA GAT GCT ATA AGA CAA CTT TTA CCC AAA 230 
Thr Ala Pro Asn lie Ser Lys Asp Ala lie Arg Gin Leu Leu Pro Lys 
70 75 80 

GCG CCT CCA CTC CGG GAA CTG ATT GAT CAG TAT GAT GTC CAG AGG GAT 275 
Ala Pro Pro Leu Arg Glu Leu lie Asp Gin Tyr Asp Val Gin Arg Asp 
85 90 95 100 

GAC AGC AGC GAT GGC TCT TTG GAA GAT GAC GAT TAT CAC GCT ACA ACG 320 
Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His Ala Thr Thr 
105 110 115 

GAA ACA ATC ATT ACC ATG CCT ACA GAG TCT GAT TTT TTA ATG CAA GTG 365 
Glu Thr lie lie Thr Met Pro Thr Glu Ser Asp Phe Leu Met Gin Val 
120 125 130 

GAT GGA AAA CCC AAA TGT TGC TTC TTT AAA TTT .AGC TCT AAA ATA CAA 410 
Asp Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser Lys lie Gin 
135 140 145 



TAC AAT AAA GTG GTA AAG GCC CAA CTA 
Tyr Asn Lys Val Val Lys Ala Gin Leu 
150 155 

GAG ACT CCT ACA ACA GTG TTT GTG CAA 
Glu Thr Pro Thr Thr Val Phe Val Gin 
165 170 

ATG AAA GAC GGT ACA AGG TAT ACT GGA 
Met Lys Asp Gly Thr Arg Tyr Thr Gly 
185 

ATG AAC CCA GGC ACT GGT ATT TGG CAG 
Met Ser Pro Gly Thr Gly lie Trp Gin 
200 205 



TGG ATA TAT TTG AGA CCC GTC 4 55 

Trp lie Tyr Leu Arg Pro Val 
160 

ATC CTG AGA CTC ATC AAA CCT 50 0 

lie Leu Arg Leu lie Lys Pro 
175 180 

ATC CGA TCT CTG AAA CTT GAC 54 5 

lie Arg Ser Leu Lys Leu Asp 
190 195 

AGC ATT GAT GTG AAG ACA GTG 590 
Ser lie Asp Val Lys Thr Val 
210 
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TTG CAA AAT TGG CTC AAA CAA CCT GAA TCC AAC TTA GGC ATT GAA ATA 635 
Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu Gly He Glu He 
215 220 225 

AAA GCT TTA GAT GAG AAT GGT CAT GAT CTT GCT GTA ACC TTC CCA GGA 680 
Lys Ala Leu Asp Glu Asn Gly His Asp Leu Ala Val Thr Phe Pro Gly 
230 235 240 

CCA GGA GAA GAT GGG CTG AAT CCC TTT TTA GAG GTC AAG GTA ACA GAC 725 
Pro Gly Glu Asp Gly Leu Asn Pro Phe Leu Glu Val Lys Val Thr Asp 
245 250 255 260 

ACA CCC AAA AGA TCC AGA AGG GAT TTT GGT CTT GAC TGT GAT GAG CAC 770 
Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys Asp Glu His 
265 270 275 

TCA ACA GAA TCG CGA TGC TGT CGT TAC CCT CTA ACT GTG GAT TTT GAA 815 
Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val Asp Phe Glu 
280 285 290 

GCT CTT GGA TGG GAT TGG ATT ATC GCT CCT AAA AGA TAT AAG GCC AAT 860 
Ala Phe Gly Trp Asp Trp He He Ala Pro Lys Arg Tyr Lys Ala Asn 
295 300 305 

TAC TGC TCT GGA GAG TGT GAA TTT GTA TTT TTA CAA AAA TAT CCT CAT 905 
Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin Lys Tyr Pro His 
310 315 320 

ACT CAT CTG GTA CAC CAA GCA AAC CCC AGA GGT TCA GCA GGC CCT TGC 950 
Thr His Leu Val His Gin Ala Asn Pro Arg Gly Ser Ala Gly Pro Cys 
325 330 335 340 

TGT ACT CCC ACA AAG ATG TCT CCA ATT AAT ATG CTA TAT TTT AAT GGC 995 
Cys Thr Pro Thr Lys Met Ser Pro He Asn Met Leu Tyr Phe Asn Gly 
345 350 355 

AAA GAA CAA ATA ATA TAT GGG AAA ATT CCA GCC ATG GTA GTA GAC CGC 1040 
Lys Glu Gin He He Tyr Gly Lys He Pro Ala Met Val Val Asp Arg 
360 365 370 

TGC GGG TGC TCA TGA 1055 
Cys Gly Cys Ser 
375 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 376 amino acids 

(B) TYPE: amino acid 
( D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Baboon GDF-8 

(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1...376 
(D) OTHER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Gin Lys 
1 



BNSDOCIO. <WO 99401B1A1 \_-> 



WO 99/40181 



PCT7US99/0251 1 



-16- 



Leu 


Gin 
5 


Leu 


Cys 


Val 


Tyr 


He 
10 


Tyr 


Leu 


Phe 


Met 


Leu 
15 


He 


Val 


Ala 


Gly 
20 


Pro 


Val 


Asp 


Leu 


Asn 


Glu 


Asn 


Ser 


Glu 


Gin 


Lys 


Glu 


Asn 


Val 


Glu 


Lys 








25 










30 










35 




Glu 


Gly 


Leu 


Cys 
40 


Asn 


Ala 


Cys 


Thr 


Trp 
45 


Arg 


Gin 


Asn 


Thr 


Lys 
50 


Ser 


Ser 


Arg 


He 


Glu 


Ala 


He 


Lys 


He 


Gin 


He 


Leu 


Ser 


Lys 


Leu 


Arg 


Leu 


Glu 




55 










60 










65 








Thr 


Ala 
70 


Pro 


Asn 


He 


Ser 


Lys 
75 


Asp 


Ala 


He 


Arg 


Gin 
80 


Leu 


Leu 


Pro 


Lys 


Ala 


Pro 


Pro 


Leu 


Arg 


Glu 


Leu 


He 


Asp 


Gin 


Tyr 


Asp 


Val 


Gin 


Arg 


Asp 


85 










90 










95 










100 


Asp 


Ser 


Ser 


Asp 


Gly 


Ser 


Leu 


Glu 


Asp 


Asp 


Asp 


Tyr 


His 


Ala 


Thr 


Thr 








105 










110 










115 




Glu 


Thr 


He 


He 


Thr 


Met 


Pro 


Thr 


Glu 


Ser Asp 


Phe 


Leu 


Met 


Gin 


Val 








120 










125 










130 






Asp 


Gly 


Lys 
135 


Pro 


Lys 


Cys 


Cys 


Phe 
140 


Phe 


Lys 


Phe 


Ser 


Ser 
145 


Lys 


He 


Gin 


Tyr 


Asn 


Lys 


Val 


Val 


Lys 


Ala 


Gin 


Leu 


Trp 


He 


Tyr 


Leu 


Arg 


Pro 


Val 


150 










155 










160 










Glu 


Thr 


Pro 


Thr 


Thr 


Val 


Phe 


Val 


Gin 


He 


Leu 


Arg 


Leu 


He 


Lys 


Pro 


165 










170 










175 










180 


Met 


Lys 


Asp 


Gly 


Thr 


Arg 


Tyr 


Thr 


Gly 


He 


Arg 


Ser 


Leu 


Lys 


Leu 


Asp 










185 








190 








195 


Met 


Ser 


Pro 


Gly 


Thr 


Gly 


He 


Trp 


Gin 


Ser 


He 


Asp 


Val 


Lys 


Thr 


Val 








200 








205 








210 




Leu 


Gin 


Asn 


Trp 


Leu 


Lys 


Gin 


Pro 


Glu 


Ser 


Asn 


Leu 


Gly 


He 


Glu 


He 






215 








220 










225 








Lys 


Ala 


Leu 


Asp 


Glu 


Asn 


Gly 


His 


Asp 


Leu 


Ala 


Val 


Thr 


Phe 


Pro 


Gly 


230 










235 










240 










Pro 


Gly Glu Asp 


Gly 


Leu 


Asn 


Pro 


Phe 


Leu 


Glu 


Val 


Lys 


Val 


Thr 


Asp 


245 










250 










255 










260 


Thr 


Pro 


Lys 


Arg 


Ser 
265 


Arg 


Arg 


Asp 


Phe 


Gly 
270 


Leu 


Asp 


Cys 


Asp 


Glu 

275 


His 


Ser 


Thr 


Glu 


Ser 
280 


Arg 


Cys 


Cys 


Arg 


Tyr 
285 


Pro 


Leu 


Thr 


Val 


Asp 
290 


Phe 


Glu 


Ala 


Phe 


Gly Trp 


Asp 


Trp 


He 


He 


Ala 


Pro 


Lys 


Arg 


Tyr 


Lys 


Ala 


Asn 






295 










300 










305 








Tyr 


Cys 
310 


Ser 


Gly 


Glu 


Cys 


Glu 
315 


Phe 


Val 


Phe 


Leu 


Gin 
320 


Lys 


Tyr 


Pro 


His 


Thr 


His 


Leu 


Val 


His 


Gin 


Ala 


Asn 


Pro 


Arg 


Gly 


Ser 


Ala 


Gly 


Pro 


Cys 


325 










330 










335 










340 


Cys 


Thr 


Pro 


Thr 


Lys 


Met 


Ser 


Pro 


He 


Asn 


Met 


Leu 


Tyr 


Phe 


Asn 


Gly 








345 










350 










355 




Lys 


Glu 


Gin 


He 


He 


Tyr 


Gly Lys 


He 


Pro 


Ala 


Met 


Val 


Val 


Asp 


Arg 








360 










365 










370 






Cys 


Gly Cys 


Ser 






























375 





























(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1055 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: genomic DNA 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Bovine GDF-8 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

<B) LOCATION: 1...1055 
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994QtFHAl 1 > 



WO 99/4018. 



PCT/US99/0251 1 



-17- 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

ATG CAA AAA 9 
Met Gin Lys 
1 

CTG CAA ATC TCT GTT TAT ATT TAC CTA TTT ATG CTG ATT GTT GCT GGC 54 
Leu Gin lie Ser Val Tyr lie Tyr Leu Phe Met Leu lie Val Ala Gly 
5 10 15 20 

CCA GTG GAT CTG AAT GAG AAC AGC GAG CAG AAG GAA AAT GTG GAA AAA 95 
Pro Val Asp Leu Asn Glu Asn Ser Glu Gin Lys Glu Asn Val Glu Lys 
25 30 35 

GAG GGG CTG TGT AAT GCA TGT TTG TGG AGG GAA AAC ACT ACA TCG TCA 14 0 

Glu Gly Leu Cys Asn Ala Cys Leu Trp Arg Glu Asn Thr Thr Ser Ser 
40 45 50 

AGA CTA GAA GCC ATA AAA ATC CAA ATC CTC AGT AAA CTT CGC CTG GAA 185 
Arg Leu Glu Ala He Lys He Gin He Leu Ser Lys Leu Arg Leu Glu 
55 60 65 

ACA GCT CCT AAC ATC AGC AAA GAT GCT ATC AGA CAA CTT TTG CCC AAG 230 
Thr Ala Pro Asn He Ser Lys Asp Ala He Arg Gin Leu Leu Pro Lys 
70 75 80 

GCT CCT CCA CTC CTG GAA CTG ATT GAT CAG TTC GAT GTC CAG AGA GAT 275 
Ala Pro Pro Leu Leu Glu Leu He Asp Gin Phe Asp Val Gin Arg Asp 
85 90 95 100 

GCC AGC AGT GAC GGC TCC TTG GAA GAC GAT GAC TAC CAC GCC AGG ACG 320 
Ala Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His Ala Arg Thr 
105 110 115 

GAA ACG GTC ATT ACC ATG CCC ACG GAG TCT GAT CTT CTA ACG CAA GTG 365 
Glu Thr Val He Thr Met Pro Thr Glu Ser Asp Leu Leu Thr Gin Val 
120 125 130 

GAA GGA AAA CCC AAA TGT TGC TTC TTT AAA TTT AGC TCT AAG ATA CAA 410 
Glu Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser Lys He Gin 
135 140 145 

TAC AAT AAA CTA GTA AAG GCC CAA CTG TGG ATA TAT CTG AGG CCT GTC 455 
Tyr Asn Lys Leu Val Lys Ala Gin Leu Trp He Tyr Leu Arg Pro Val 
150 155 160 

AAG ACT CCT GCG ACA GTG TTT GTG CAA ATC CTG AGA CTC ATC AAA CCC 50 0 

Glu Thr Pro Thr Ala Val Phe Val Gin He Leu Arg Leu He Lys Pro 
165 170 175 180 

ATG AAA GAC GGT ACA AGG TAT ACT GGA ATC CGA TCT CTG AAA CTT GAC 54 5 

Met Lys Asp Gly Thr Arg Tyr Thr Gly He Arg Ser Leu Lys Leu Asp 
185 . 190 195 

ATG AAC CCA GGC ACT GGT ATT TGG CAG AGC ATT GAT GTG AAG ACA GTG 590 
Met Ser Pro Gly Thr Gly He Trp Gin Ser He Asp Val Lys Thr Val 
200 205 210 

TTG CAG AAC TGG CTC AAA CAA CCT GAA TCC AAC TTA GGC ATT GAA ATC 635 
Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu Gly He Glu He 
215 220 225 
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AAA GCT TTA GAT GAG AAT GGC CAT GAT CTT GCT GTA ACC TTC CCA GAA 680 
Lys Ala Leu Asp Glu Asn Gly His Asp Leu Ala Val Thr Phe Pro Glu 
230 235 240 

CCA GGA GAA GAT GGA CTG ACT CCC TTT TTA GAA GTC AAG GTA ACA GAC 725 
Pro Gly Glu Asp Gly Leu Thr Pro Phe Leu Glu Val Lys Val Thr Asp 
245 250 255 260 

ACA CCA AAA AGA TCT AGG AGA GAT TTT GGG CTT GAT TGT GAT GAA CAC 77 0 

Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys Asp Glu His 
265 270 275 

TCC ACA GAA TCT CGA TGC TGT CGT TAC CCT CTA ACT GTG GAT TTT GAA 815 
Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu . Thr Val Asp Phe Glu 
280 285 290 

GCT TTT GGA TGG GAT TGG ATT ATT GCA CCT AAA AGA TAT AAG GCC AAT 860 
Ala Phe Gly Trp Asp Trp lie lie Ala Pro Lys Arg Tyr Lys Ala Asn 
295 300 305 

TAC TGC TCT GGA GAA TGT GAA TTT GTA TTT TTG CAA AAG TAT CCT CAT 905 
Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin Lys Tyr Pro His 
310 315 320 

ACC CAT CTT GTG CAC CAA GCA AAC CCC AGA GGT TCA GCC GGC CCC TGC 950 
Thr His Leu Val His Gin Ala Asn Pro Arg Gly Ser Ala Gly Pro Cys 
325 330 335 340 

TGT ACT CCT ACA AAG ATG TCT CCA ATT AAT ATG CTA TAT TTT AAT GGC 995 
Cys Thr Pro Thr Lys Met Ser Pro lie Asn Met Leu Tyr Phe Asn Gly 
345 350 355 

GAA GGA CAA ATA ATA TAC GGG AAG ATT CCA GCC ATG GTA GTA GAT CGC 104 0 

Lys Glu Gin lie lie Tyr Gly Lys lie Pro Ala Met Val Val Asp Arg 
360 365 370 

TGT GGG TGT TCA TGA 10 S 

Cys Gly Cys Ser 
375 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRI 



Leu 


Gin 


lie 


Ser 


Val 


Tyr 


He 


Tyr 


5 










10 






Pro 


Val 


Asp 


Leu 


Asn 


Glu 


Asn 


Ser 










25 








Glu 


Gly 


Leu 


Cys 


Asn 


Ala 


Cys 


Leu 








40 










Arg 


Leu 


Glu 


Ala 


He 


Lys 


He 


Gin 






55 










60 


Thr 


Ala 


Pro 


Asn 


He 


Ser 


Lys 


Asp 



70 75 



?: SEQ ID NO: 21: 

Met Gin Lys 
1 

Leu Phe Met Leu He Val Ala Gly 
15 20 
Glu Gin Lys Glu Asn Val Glu Lys 

30 35 
Trp Arg Glu Asn Thr Thr Ser Ser 

45 50 
He Leu Ser Lys Leu Arg Leu Glu 
65 

Ala He Arg Gin Leu Leu Pro Lys 
80 
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Ala Pro Pro Leu 
85 

Ala Ser Ser Asp 

Glu Thr Val lie 
120 

Glu Gly Lys Pro 
135 

Tyr Asn Lys Leu 
150 

Glu Thr Pro Thr 
165 

Met Lys Asp Gly 

Met Ser Pro Gly 
200 

Leu Gin Asn Trp 
215 

Lys Ala Leu Asp 
230 

Pro Gly Glu Asp 
245 

Thr Pro Lys Arg 

Ser Thr Glu Ser 
280 

Ala Phe Gly Trp 
295 

Tyr Cys Ser Gly 
310 

Thr His Leu Val 
325 

Cys Thr Pro Thr 

Lys Glu Gin lie 
360 

Cys Gly Cys Ser 
375 



Leu Glu Leu lie 
90 

Gly Ser Leu Glu 
105 

Thr Met Pro Thr 

Lys Cys Cys Phe 
140 

Val Lys Ala Gin 
155 

Ala Val Phe Val 
170 

Thr Arg Tyr Thr 
185 

Thr Gly lie Trp 

Leu Lys Gin Pro 
220 

Glu Asn Gly His 
235 

Gly Leu Thr Pro 
250 

Ser Arg Arg Asp 
265 

Arg Cys Cys Arg 

Asp Trp lie lie 
300 

Glu Cys Glu Phe 
315 

His Gin Ala Asn 
330 

Lys Met Ser Pro 
345 

lie Tyr Gly Lys 



Asp Gin Phe Asp 
95 

Asp Asp Asp Tyr 
110 

Glu Ser Asp Leu 
125 

Phe Lys Phe Ser 

Leu Trp lie Tyr 
160 

Gin lie Leu Arg 
175 

Gly lie Arg Ser 
190 

Gin Ser lie Asp 
205 

Glu Ser Asn Leu 

Asp Leu Ala Val 
240 

Phe Leu Glu Val 
255 

Phe Gly Leu Asp 
270 

Tyr Pro Leu Thr 
285 

Ala Pro Lys Arg 

Val Phe Leu Gin 
320 

Pro Arg Gly Ser 
335 

lie Asn Met Leu 
350 

lie Pro Ala Met 
365 



Val Gin Arg Asp 
100 

His Ala Arg Thr 
115 

Leu Thr Gin Val 
130 

Ser Lys lie Gin 
145 

Leu Arg Pro Val 

Leu lie Lys Pro 
180 

Leu Lys Leu Asp 
195 

Val Lys Thr Val 
21 

Gly lie Glu lie 
225 

Thr Phe Pro Glu 

Lys Val Thr Asp 
260 

Cys Asp Glu His 
275 

Val Asp Phe Glu 
290 

Tyr Lys Ala Asn 
305 

Lys Tyr Pro His 

Ala Gly Pro Cys 
340 

Tyr Phe Asn Gly 
355 

Val Val Asp Arg 
370 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1055 base pairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Chicken GDF-8 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1...1055 
(D) OTHER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

ATG CAA AAG 9 
Met Gin Lys 
1 

CTG GCA GTC TAT GTT TAT ATT TAC CTG TTC ATG CAG ATC GCG GTT GAT 54 
Leu Ala Val Tyr Val Tyr lie Tyr Leu Phe Met Gin lie Ala Val Asp 
5 10 15 20 
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CCG GTG GCT CTG GAT GGC AGT AGT CAG CCC ACA GAG AAC GCT GAA AAA 95 
Pro Val Ala Leu Asp Gly Ser Ser Glu Gin Lys Glu Asn Val Glu Lys 
25 30 35 

GAC GGA CTG TGC AAT GCT TGT ACG TGG AGA CAG AAT ACA AAA TCC TCC 14 0 

Glu Gly Leu Cys Asn Ala Cys Thr Trp Arg Gin Asn Thr Lys Ser Ser 
40 45 50 

AGA ATA GAA GCC ATA AAA ATT CAA ATC CTC AGC AAA CTG CGC CTG GAA 185 
Arg lie Glu Ala lie Lys lie Gin lie Leu Ser Lys Leu Arg Leu Glu 
55 60 65 

CAA GCA CCT AAC ATT AGC AGG GAC GTT ATT AAG CAG CTT TTA CCC AAA 230 
Gin Ala Pro Asn lie Ser Arg Asp Val lie Lys Gin Leu Leu Pro Lys 
70 75 80 

GCT CCT CCA CTG CAG GAA CTG ATT GAT CAG TAT GAT GTC CAG AGG GAC 275 
Ala Pro Pro Leu Gin Glu Leu lie Asp Gin Tyr Asp Val Gin Arg Asp 
85 90 95 100 

GAC AGT AGC GAT GGC TCT TTG GAA GAC GAT GAC TAT CAT GCC ACA ACC 320 
Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His Ala Thr Thr 
105 110 115 

GAG ACG ATT ATC ACA ATG CCT ACG GAG TCT GAT TTT CTT GTA CAA ATG 365 
Glu Thr lie lie Thr Met Pro Thr Glu Ser Asp Phe Leu Val Gin Met 
120 125 130 

GAG GGA AAA CCA AAA TGT TGC TTC TTT AAG TTT AGC TCT AAA ATA CAA 410 
Glu Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser Lys lie Gin 
135 140 145 

TAT AAC AAA GTA GTA AAG GCA CAA TTA TGG ATA TAC TTG AGG CAA GTC 455 
Tyr Asn Lys Val Val Lys Ala Gin Leu Trp lie Tyr Leu Arg Gin Val 
150 155 160 

CAA AAA CCT ACA ACG GTG TTT GTG CAG ATC CTG AGA CTC ATT AAG CCC 500 
Gin Lys Pro Thr Thr Val Phe Val Gin lie Leu Arg Leu lie Lys Pro 
165 170 175 180 

ATG AAA GAC GGT ACA AGA TAT ACT GGA ATT CGA TCT TTG AAA CTT GAC 545 
Met Lys Asp Gly Thr Arg Tyr Thr Gly lie Arg Ser Leu Lys Leu Asp 
185 190 195 

ATG AAC CCA GGC ACT GGT ATC TGG CAG AGT ATT GAT GTG AAG ACA GTG 590 
Met Ser Pro Gly Thr Gly lie Trp Gin Ser lie Asp Val Lys Thr Val 
200 205 210 

CTG CAA AAT TGG CTC AAA CAG CCT GAA TCC AAT TTA GGC ATC GAA ATA 635 
Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu Gly lie Glu lie 
215 220 225 

AAA GCT TTT GAT GAG ACT GGA CGA GAT CTT GCT GTC ACA TTC CCA GGA 6 80 

Lys Ala Phe Asp Glu Thr Gly Arg Asp Leu Ala Val Thr Phe Pro Gly 
230 235 240 

CCA GGA GAA GAT GGA TTG AAC CCA TTT TTA GAG GTC AGA GTT ACA GAC 725 
Pro Gly Glu Asp Gly Leu Asn Pro Phe Leu Glu Val Arg Val Thr Asp 
245 250 255 260 

ACA CCG AAA CGG TCC CGC AGA GAT TTT GGC CTT GAC TGT GAT GAG CAC 77 0 

Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys Asp Glu His 
265 270 275 
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TCA ACG GAA TCC CGA TGT TGT CGC TAC CCG CTG ACA GTG GAT TTC GAA 815 
Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val Asp Phe Glu 
280 285 290 

GCT TTT GGA TGG GAC TGG ATT ATA GCA CCT AAA AGA TAC AAA GCC AAT 860 
Ala Phe Gly Trp Asp Trp lie lie Ala Pro Lys Arg Tyr Lys Ala Asn 
295 300 305 

TAC TGC TCC GGA GAA TGC GAA TTT GTG TTT CTA CAG AAA TAC CCG CAC 905 
Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin Lys Tyr Pro His 
310 315 320 

ACT CAC CTG GTA CAC CAA GCA AAT CCC AGA GGC TCA GCA GGC CCT TGC 950 
Thr His Leu Val His Gin Ala Asn Pro Arg Gly Ser Ala Gly Pro Cys 
325 330 335 340 

TGC ACA CCC ACC AAG ATG TCC CCT ATA AAC ATG CTG TAT TTC AAT GGA 995 
Cys Thr Pro Thr Lys Met Ser Pro lie Asn Met Leu Tyr Phe Asn Gly 
345 350 355 

AAA GAA CAA ATA ATA TAT GGA AAG ATA CCA GCC ATG GTT GTA GAT CGT 1040 
Lys Glu Gin lie lie Tyr Gly Lys lie Pro Ala Met Val Val Asp Arg 
360 365 370 

TGC GGG TGC TCA TGA 1055 
Cys Gly Cys Ser 
375 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 376 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(v) FRAGMENT TYPE: internal 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Chicken GDF-8 
(ix) FEATURE: 

(A) NAME/ KEY : Protein 

(B) LOCATION: 1 . . 376 
(D) OTHER: 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Met Gin Lys 
1 



Leu 


Ala 


Val 


Tyr 


Val 


Tyr 


He 


Tyr 


Leu 


Phe 


Met 


Gin 


He 


Ala 


Val 


Asp 


5 








10 










15 










20 


Pro 


Val 


Ala 


Leu 


Asp 


Gly 


Ser 


Ser 


Glu 


Gin 


Lys 


Glu 


Asn 


Val 


Glu 


Lys 










25 








30 










35 




Glu 


Gly Leu 


Cys 


Asn 


Ala 


Cys 


Thr 


Trp 


Arg 


Gin 


Asn 


Thr 


Lys 


Ser 


Ser 








40 










45 










50 




Glu 


Arg 


He 


Glu 


Ala 


He 


Lys 


He 


Gin 


He 


Leu 


Ser 


Lys 


Leu 


Arg 


Leu 




55 










60 










65 






Lys 


Gin 


Ala 
70 


Pro 


Asn 


He 


Ser 


Arg 
75 


Asp 


Val 


He 


Lys 


Gin 
80 


Leu 


Leu 


Pro 


Ala 


Pro 


Pro 


Leu 


Gin 


Glu 


Leu 


He 


Asp 


Gin 


Tyr 


Asp 


Val 


Gin 


Arg Asp 


85 










90 










95 










100 


Asp 


Ser 


Ser 


Asp 


Gly 


Ser 


Leu 


Glu 


Asp 


Asp 


Asp 


Tyr 


His 


Ala 


Thr 


Thr 






105 










110 










11 




Glu 


Thr 


He 


He 


Thr 


Met 


Pro 


Thr 


Glu 


Ser 


Asp 


Phe 


Leu 


Val 


Gin 


Met 



120 125 130 
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Glu 


Gly 


Lys 


Pro 






135 




Tyr 


Asn 


Lys 


Val 




150 






Gin 


Lys 


Pro 


Thr 


165 








Met 


Lvs 


Asp 


Gly 


Met 


Ser 


Pro 


Gly 








200 


Leu 


Gin 


Asn 


Trp 






215 




Lys 


Ala 


Phe 


Asp 




230 






Pro 


Gly 


Glu 


Asp 


245 








Thr 


Pro 


Lys 


Arg 


Ser 


Thr 


Glu 


Ser 








280 


Ala 


Phe 


Gly 


Trp 






295 




Tyr 


Cys 


Ser 


Gly 




310 






Thr 


His 


Leu 


Val 


325 








Cys 


Thr 


Pro 


Thr 


Lys 


Glu 


Gin 


He 






360 


Cys 


Gly 


Cys 


Ser 



375 



Lys Cys Cys Phe 
140 

Val Lys Ala Gin 
155 

Thr Val Phe Val 
170 

Thr Arg Tyr Thr 
185 

Thr Gly He Trp 

Leu Lys Gin Pro 
220 

Glu Thr Gly Arg 
235 

Gly Leu Asn Pro 
250 

Ser Arg Arg Asp 
265 

Arg Cys Cys Arg 

Asp Trp He He 
300 

Glu Cys Glu Phe 
315 

His Gin Ala Asn 
330 

Lys Met Ser Pro 
345 

He Tyr Gly Lys 



Phe Lys Phe Ser 

Leu Trp He Tyr 
160 

Gin He Leu Arg 
175 

Gly He Arg Ser 
190 

Gin Ser He Asp 
205 

Glu Ser Asn Leu 

Asp Leu Ala Val 
240 

Phe Leu Glu Val 
255 

Phe Gly Leu Asp 
270 

Tyr Pro Leu Thr 
285 

Ala Pro Lys Arg 

Val Phe Leu Gin 
320 

Pro Arg Gly Ser 
335 

He Asn Met Leu 
350 

He Pro Ala Met 
365 



Ser Lys He Gin 
145 

Leu Arg Gin Val 

Leu He Lys Pro 
180 

Leu Lys Leu Asp 
195 

Val Lys Thr Val 
210 

Gly He Glu He 
225 

Thr Phe Pro Gly 

Arg Val Thr Asp 
260 

Cys Asp Glu His 
275 

Val Asp Phe Glu 
290 

Tyr Lys Ala Asn 
305 

Lys Tyr Pro His 

Ala Gly Pro Cys 
340 

Tyr Phe Asn Gly 
355 

Val Val Asp Arg 
370 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1276 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: Rat GDF-8 

( ix) FEATURE : 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1 . . . 1276 
(D) OTHER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:24: 



ATG ATT CAA AAA 115 
Met He Gin Lys 
1 

CCG CAA ATG TAT GTT TAT ATT TAC CTG TTT GTG CTG ATT GCT GCT GGC 163 
Pro Gin Met Tyr Val Tyr He Tyr Leu Phe Val Leu He Ala Ala Gly 
5 10 15 20 

CCA GTG GAT CTA AAT GAG GAC AGT GAG AGA GAG GCG AAT GTG GAA AAA 211 
Pro Val Asp Leu Asn Glu Asp Ser Glu Arg Glu Ala Asn Val Glu Lys 
25 30 35 
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GAG GGG CTG TGT AAT GCG TGT GCG TGG AGA CAA AAC ACA AGG TAC TCC 259 
Glu Gly Leu Cys Asn Ala Cys Ala Trp Arg Gin Asn Thr Arg Tyr Ser 
40 45 50 

AGA ATA GAA GCC ATA AAA ATT CAA ATC CTC AGT AAA CTC CGC CTG GAA 307 
Arg He Glu Ala He Lys He Gin He Leu Ser Lys Leu Arg Leu Glu 
55 60 €5 

ACA GCG CCT AAC ATC AGC AAA GAT GCT ATA AGA CAA CTT CTG CCC AGA 355 
Thr Ala Pro Asn He Ser Lys Asp Ala He Arg Gin Leu Leu Pro Arg 
70 75 80 

GCG CCT CCA CTC CGG GAA CTG ATC GAT CAG TAC GAC GTC CAG AGG GAT 403 
Ala Pro Pro Leu Arg Glu Leu He Asp Gin Tyr Asp Val Gin Arg Asp 
85 90 95 100 

GAC AGC AGT GAC GGC TCT TTG GAA GAT GAC GAT TAT CAC GCT ACC ACG 451 
Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His Ala Thr Thr 
105 HO 115 

GAA ACA ATC ATT ACC ATG CCT ACC GAG TCT GAC TTT CTA ATG CAA GCG 499 
Glu Thr He He Thr Met Pro Thr Glu Ser Asp Phe Leu Met Gin Ala 
120 125 130 

GAT GGA AAG CCC AAA TGT TGC TTT TTT AAA TTT AGC TCT AAA ATA CAG 547 
Asp Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser Lys He Gin 
135 140 145 

TAC AAC AAA GTG GTA AAG GCC CAG CTG TGG ATA TAT CTG AGA GCC GTC 595 
Tyr Asn Lys Val Val Lys Ala Gin Leu Trp He Tyr Leu Arg Ala Val 
150 155 160 

AAG ACT CCT ACA ACA GTG TTT GTG CAA ATC CTG AGA CTC ATC AAA CCC 643 
Lys Thr Pro Thr Thr Val Phe Val Gin He Leu Arg Leu He Lys Pro 
165 170 175 180 

ATG AAA GAC GGT ACA AGG TAT ACC GGA ATC CGA TCT CTG AAA CTT GAC 691 
Met Lys Asp Gly Thr Arg Tyr Thr Gly He Arg Ser Leu Lys Leu Asp 
185 190 195 

ATG AGC CCA GGC ACT GGT ATT TGG CAG AGT ATT GAT GTG AAG ACA GTG 739 
Met Ser Pro Gly Thr Gly He Trp Gin Ser He Asp Val Lys Thr Val 
200 205 210 

TTG CAA AAT TGG CTC AAA CAG CCT GAA TCC AAC TTA GGC ATT GAA ATC 787 
Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu Gly He Glu He 
215 220 225 

AAA GCT TTG GAT GAG AAT GGG CAT GAT CTT GCT GTA ACC TTC CCA GGA 835 
Lys Ala Leu Asp Glu Asn Gly His Asp Leu Ala Val Thr Phe Pro Gly 
230 235 240 

CCA GGA GAA GAT GGG CTG AAT CCC TTT TTA GAA GTC AAA GTA ACA GAC 883 
Pro Gly Glu Asp Gly Leu Asn Pro Phe Leu Glu Val Lys Val Thr Asp 
245 250 255 260 

ACA CCC AAG AGG TCC CGG AGA GAC TTT GGG CTT GAC TGC GAT GAA CAC 931 
Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys Asp Glu His 
265 270 275 

TCC ACG GAA TCG CGG TGC TGT CGC TAC CCC CTC ACG GTC GAT TTC GAA 979 
Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val Asp Phe Glu 
280 285 290 
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GCC TTT GGA TGG GAC TGG ATT ATT GCA CCC AAA AGA TAT AAG GCT AAT 1027 
Ala Phe Gly Trp Asp Trp lie lie Ala Pro Lys Arg Tyr Lys Ala Asn 
295 300 305 

TAC TGC TCT GGA GAG TGT GAA TTT GTG TTC TTA CAA AAA TAT CCG CAT 1075 
Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin Lys Tyr Pro His 
310 315 320 

ACT CAT CTT GTG CAC CAA GCA AAC CCC AGA GGC TCG GCA GGC CCT TGC 1123 
Thr His Leu Val His Gin Ala Asn Pro Arg Gly Ser Ala Gly Pro Cys 
325 330 335 340 

TGC ACG CCA ACA AAA ATG TCT CCC ATT AAT ATG CTA TAT TTT AAT GGC 1171 
Cys Thr Pro Thr Lys Met Ser Pro lie Asn Met Leu Tyr Phe Asn Gly 
345 350 355 

AAA GAA CAA ATA ATA TAT GGG AAA ATT CCA GCC ATG GTA GTA GAC CGG 1219 
Lys Glu Gin lie lie Tyr Gly Lys lie Pro Ala Met Val Val Asp Arg 
360 365 370 

TGT GGG TGC TCG TGA 1276 
Cys Gly Cys Ser 
375 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 376 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vii) IMMEDIATE SOURCE: 
(B) CLONE: Rat GDF-8 

(ix) FEATURE: 

(A) NAME /KEY : Protein 

(B) LOCATION: 1..376 
(D) OTHER: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met lie Gin Lys 
1 



Pro 


Gin 


Met 


Tyr 


Val 


Tyr 


He 


Tyr 


Leu 


Phe 


Val 


Leu 


He 


Ala 


Ala 


Gly 


5 










10 










15 










20 


Pro 


Val 


Asp 


Leu 


Asn 


Glu 


Asp 


Ser 


Glu 


Arg 


Glu 


Ala 


Asn 


Val 


Glu 


Lys 










25 










30 










35 




Glu Gly 


Leu 


Cys 


Asn 


Ala 


Cys 


Ala 


Trp 


Arg 


Gin 


Asn 


Thr 


Arg 


Tyr 


Ser 








40 










45 










50 






Arg 


He 


Glu 


Ala 


He 


Lys 


He 


Gin 


He 


Leu 


Ser 


Lys 


Leu 


Arg 


Leu 


Glu 






55 










60 










65 








Thr 


Ala 


Pro 


Asn 


He 


Ser 


Lys 


Asp 


Ala 


He 


Arg 


Gin 


Leu 


Leu 


Pro 


Arg 




70 










75 










80 










Ala 


Pro 


Pro 


Leu 


Arg 


Glu 


Leu 


He 


Asp 


Gin 


Tyr 


Asp 


Val 


Gin 


Arg 


Asp 


85 










90 










95 










100 


Asp 


Ser 


Ser 


Asp 


Gly 


Ser 


Leu 


Glu 


Asp 


Asp 


Asp 


Tyr 


His 


Ala 


Thr 


Thr 










105 










110 










115 




Glu 


Thr 


He 


He 


Thr 


Met 


Pro 


Thr 


Glu 


Ser 


Asp 


Phe 


Leu 


Met 


Gin 


Ala 








120 










125 










130 






Asp 


Gly 


Lys 


Pro 


Lys 


Cys 


Cys 


Phe 


Phe 


Lys 


Phe 


Ser 


Ser 


Lys 


He 


Gin 






135 










140 










145 








Tyr 


Asn 


Lys 


Val 


Val 


Lys 


Ala 


Gin 


Leu 


Trp 


He 


Tyr 


Leu 


Arg 


Ala 


Val 



150 155 160 
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Lvs 


Thr 


Pro 


Thr 


Thr 


Val Phe Val 


Gin 


He 


Leu 


Arg 


Leu 


He 


Lys 


Pro 


165 










170 






175 










180 


Met 


Lys 


Asp 


Gly 


Thr 


Arg Tyr Thr 


Gly 


He 


Arg 


Ser 


Leu 


Lys 


Leu 


Asp 








185 






190 










195 




Met 


Ser 


Pro 


Gly 
200 


Thr 


Gly He Trp 


Gin 
205 


Ser 


He 


Asp 


Val 


Lys 
210 


Thr 


Val 


Leu 


Gin 


Asn 


Trp 


Leu 


Lys Gin Pro 


Glu 


Ser 


Asn 


Leu Gly 


He 


Glu 


He 






215 




220 










225 








Lys 


Ala 


Leu 


Asp 


Glu Asn Gly His 


Asp 


Leu 


Ala 


Val 


Thr 


Phe 


Pro 


Gly 


230 








235 








240 










Pro 


Gly Glu Asp 


Gly 


Leu Asn Pro 


Phe 


Leu 


Glu 


Val 


Lys 


Val 


Thr 


Asp 


245 








250 






255 










260 


Thr 


Pro 


Lys 


Arg 


Ser 


Arg Arg Asp 


Phe 


Gly 


Leu 


Asp 


Cys 


Asp 


Glu 


His 






265 






270 










275 




Ser 


Thr 


Glu 


Ser 


Arg 


Cys Cys Arg 


Tyr 


Pro 


Leu 


Thr 


Val 


Asp 


Phe 


Glu 








280 


285 










290 






Ala 


Phe 


Gly 


Trp 


Asp 


Trp He He 


Ala 


Pro 


Lys 


Arg 


Tyr 


Lys 


Ala 


Asn 






295 




300 










305 








Tyr 


Cys 


Ser 


Gly 


Glu 


Cys Glu Phe 


Val 


Phe 


Leu 


Gin 


Lys 


Tyr 


Pro 


His 


310 








315 








320 










Thr 


His 


Leu 


Val 


His 


Gin Ala Asn 


Pro 


Arg Gly 


Ser 


Ala 


Gly 


Pro 


Cys 


325 










330 






335 










340 


Cys 


Thr 


Pro 


Thr 


Lys 


Met Ser Pro 


He 


Asn 


Met 


Leu 


Tyr 


Phe 


Asn 


Gly 








345 






350 










355 




Lys 


Glu 


Gin 


He 


He 


Tyr Gly Lys 


He 


Pro 


Ala 


Met 


Val 


Val 


Asp 


Arg 






360 






365 










370 






Cys 


Gly 


Cys 


Ser 























375 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1055 base pairs 

- (B) TYPE: nucleic acid 
(C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE : Genomic DNA 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Turkey GDF-8 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1...1055 
(D) OTHER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

ATG CAA AAG 9 
Met Gin Lys 
1 

CTA GCA GTC TAT GTT TAT ATT TAC CTG TTC ATG CAG ATT TTA GTT CAT 54 
Leu Ala Val Tyr Val Tyr He Tyr Leu Phe Met Gin He Leu Val His 
5 10 15 20 

CCG GTG GCT CTT GAT GGC AGT AGT CAG CCC ACA GAG AAC GCT GAA AAA 95 
Pro Val Ala Leu Asp Gly Ser Ser Glu Gin Lys Glu Asn Val Glu Lys 
25 30 35 

GAC GGA CTG TGC AAT GCT TGC ACG TGG AGA CAG AAT ACT AAA TCC TCC 140 
Glu Gly Leu Cys Asn Ala Cys Thr Trp Arg Gin Asn Thr Lys Ser Ser 
40 45 50 
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AGA ATA GAA GCC ATA AAA ATT CAA ATC CTC AGC AAA CTG CGC CTG GAA 185 
Arg lie Glu Ala lie Lys lie Gin lie Leu Ser Lys Leu Arg Leu Glu 
55 60 65 

CAA GCA CCT AAC ATT AGC AGG GAC GTT ATT AAA CAA CTT TTA CCC AAA 23 0 

Gin Ala Pro Asn lie Ser Arg Asp Val lie Lys Gin Leu Leu Pro Lys 
70 75 80 

GCT CCT CCG CTG CAG GAA CTG ATT GAT CAG TAT GAC GTC CAG AGA GAC 275 
Ala Pro Pro Leu Gin Glu Leu lie Asp Gin Tyr Asp Val Gin Arg Asp 
85 90 95 100 

GAC AGT AGC GAT GGC TCT TTG GAA GAC GAT GAC TAT CAT GCC ACA ACC 320 
Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His Ala Thr Thr 
105 110 115 

GAA ACG ATT ATC ACA ATG CCT ACG GAG TCT GAT TTT CTT GTA CAA ATG 3 65 

Glu Thr lie lie Thr Met Pro Thr Glu Ser Asp Phe Leu Val Gin Met 
120 125 130 

GAG GGA AAA CCA AAA TGT TGC TTC TTT AAG TTT AGC TCT AAA ATA CAA 410 
Glu Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser Lys lie Gin 
135 140 145 

TAT AAC AAA GTA GTA AAG GCA CAA TTA TGG ATA TAG TTG AGG CAA GTC 4 55 

Tyr Asn Lys Val Val Lys Ala Gin Leu Trp lie Tyr Leu Arg Gin Val 
150 155 160 

CAA AAA CCT ACA ACG GTG TTT GTG CAG ATC CTG AGA CTC ATT AAA CCC 50 0 

Gin Lys Pro Thr Thr Val Phe Val Gin lie Leu Arg Leu lie Lys Pro 
165 170 175 180 

ATG AAA GAC GGT ACA AGA TAT ACT GGA ATT CGA TCT TTG AAA CTT GAC 545 
Met Lys Asp Gly Thr Arg Tyr Thr Gly lie Arg Ser Leu Lys Leu Asp 
185 190 195 

ATG AAC CCA GGC ACT GGT ATC TGG CAG AGT ATT GAT GTG AAG ACA GTG 590 
Met Ser Pro Gly Thr Gly lie Trp Gin Ser lie Asp Val Lys Thr Val 
200 205 210 

TTG CAA AAT TGG CTC AAA CAG CCT GAA TCC AAT TTA GGC ATC GAA ATA 635 
Leu Gin Asn Trp Leu Lys Gin Pro Glu Ser Asn Leu Gly lie Glu lie 
215 220 225 

AAA GCT TTT GAT GAG AAT GGA CGA GAT CTT GCT GTA ACA TTC CCA GGA 680 
Lys Ala Phe Asp Glu Asn Gly Arg Asp Leu Ala Val Thr Phe Pro Gly 
230 235 240 

CCA GGT GAA GAT GGA CTG AAC CCA TTT TTA GAG GTC AGA GTT ACA GAC 725 
Pro Gly Glu Asp Gly Leu Asn Pro Phe Leu Glu Val Arg Val Thr Asp 
245 250 255 260 

ACA CCA AAA CGG TCC CGC AGA GAT TTT GGC CTT GAC TGC GAC GAG CAC 770 
Thr Pro Lys Arg Ser Arg Arg Asp Phe Gly Leu Asp Cys Asp Glu His 
265 270 275 

TCA ACG GAA TCT CGA TGT TGT CGC TAG CCG CTG ACA GTG GAT TTT GAA 815 
Ser Thr Glu Ser Arg Cys Cys Arg Tyr Pro Leu Thr Val Asp Phe Glu 
280 285 290 

GCT TTT GGA TGG GAC TGG ATT ATA GCA CCT AAA AGA TAC AAA GCC AAT 860 
Ala Phe Gly Trp Asp Trp lie lie Ala Pro Lys Arg Tyr Lys Ala Asn 
295 300 305 
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TAC TGC TCT GGA GAA TGT GAA TTC GTA TTT CTA CAG AAA TAC CCG CAC 905 
Tyr Cys Ser Gly Glu Cys Glu Phe Val Phe Leu Gin Lys Tyr Pro His 
310 315 320 

ACT CAC CTG GTA CAC CAA GCA AAT CCA AGA GGC TCA GCA GGC CCT TGC 950 
Thr His Leu Val His Gin Ala Asn Pro Arg Gly Ser Ala Gly Pro Cys 
325 330 335 340 

TGC ACA CCC ACC AAG ATG TCC CCT ATA AAC ATG CTG TAT TTC AAT GGA 995 
Cys Thr Pro Thr Lys Met Ser Pro lie Asn Met Leu Tyr Phe Asn Gly 
345 350 355 

AAA GAA CAA ATA ATA TAT GGA AAG ATA CCA GCC ATG GTT GTA GAT CGT 104 0 
Lys Glu Gin He He Tyr Gly Lys He Pro Ala Met Val Val Asp Arg 
360 365 370 

TGC GGG TGC TCA TGA 1055 
Cys Gly Cys Ser 
375 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 376 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Turkey GDF-8 

(ix) FEATURE : 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..376 
(D) OTHER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Met Gin Lys 
1 

Leu Ala Val Tyr Val Tyr He Tyr Leu Phe Met Gin He Leu Val His 
5 io 15 20 

Pro Val Ala Leu Asp Gly Ser Ser Glu Gin Lys Glu Asn Val Glu Lys 

25 30 35 

Glu Gly Leu Cys Asn Ala Cys Thr Trp Arg Gin Asn Thr Lys Ser Ser 

40 45 50 

Arg He Glu Ala He Lys He Gin He Leu Ser Lys Leu Arg Leu Glu 

55 60 65 

Gin Ala Pro Asn He Ser Arg Asp Val He Lys Gin Leu Leu Pro Lys 

70 75 80 

Ala Pro Pro Leu Gin Glu Leu He Asp Gin Tyr Asp Val Gin Arg Asp 
85 90 95 100 

Asp Ser Ser Asp Gly Ser Leu Glu Asp Asp Asp Tyr His Ala Thr Thr 

105 HO H5 

Glu Thr He He Thr Met Pro Thr Glu Ser Asp Phe Leu Val Gin Met 

120 125 130 

Glu Gly Lys Pro Lys Cys Cys Phe Phe Lys Phe Ser Ser Lys He Gin 

135 140 145 

Tyr Asn Lys Val Val Lys Ala Gin Leu Trp He Tyr Leu Arg Gin Val 

150 155 160 

Gin Lys Pro Thr Thr Val Phe Val Gin He Leu Arg Leu He Lys Pro 
165 170 175 180 

Met Lys Asp Gly Thr Arg Tyr Thr Gly He Arg Ser Leu Lys Leu Asp 

185 190 195 

Met Ser Pro Gly Thr Gly He Trp Gin Ser He Asp Val Lys Thr Val 
200 205 210 
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Leu 


Gin 


Asn 
215 


Trp 


Leu 


Lys 


Gin 


Pro 
220 


Glu 


Ser 


Asn 


Leu 


Gly 
225 


He 


Glu 


He 


Lys 


Ala 


Phe 


Asp 


Glu 


Asn 


Gly Arg Asp 


Leu 


Ala 


Val 


Thr 


Phe 


Pro 


Gly 




230 










235 










240 










Pro Gly Glu 


Asp 


Gly 


Leu 


Asn 


Pro 


Phe 


Leu 


Glu 


Val 


Arg 


Val 


Thr 


Asp 


245 










250 










255 










260 


Thr 


Pro 


Lys 


Arg 


Ser 
265 


Arg 


Arg 


Asp 


Phe 


Gly 
270 


Leu 


Asp 


Cys 


Asp 


Glu 
275 


His 


Ser 


Thr 


Glu 


Ser 
280 


Arg 


Cys 


Cys 


Arg 


Tyr 
285 


Pro 


Leu 


TViv 

i xir 


val 


Asp 
290 


Phe 


Glu 


Ala 


Phe Gly 


Trp 


Asp 


Trp 


He 


He 


Ala 


Pro 


Lys 


Arg 


Tyr 


Lys 


Ala 


Asn 






295 










300 










305 








Tyr 


Cys 


Ser 


Gly Glu Cys 


Glu 


Phe 


Val 


Phe 


Leu 


Gin 


Lys 


Tyr 


Pro 


His 


310 










315 










320 










Thr 


His 


Leu 


Val 


His 


Gin 


Ala 


Asn 


Pro 


Arg 


Gly 


Ser 


Ala 


Gly 


Pro 


Cys 


325 










330 










335 










340 


Cys 


Thr 


Pro 


Thr 


Lys 


Met 


Ser 


Pro 


He 


Asn 


Met 


Leu 


Tyr 


Phe 


Asn Gly 








345 










350 










355 




Lys 


Glu 


Gin 


He 


He 


Tyr 


Gly 


Lys 


He 


Pro 


Ala 


Met 


Val 


Val 


Asp 


Arg 



360 365 370 

Cys Gly Cys Ser 
375 
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